Dataset statistics
| Number of variables | 48 |
|---|---|
| Number of observations | 4272 |
| Missing cells | 100400 |
| Missing cells (%) | 49.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 1.6 MiB |
| Average record size in memory | 384.0 B |
Variable types
| Numeric | 7 |
|---|---|
| Unsupported | 2 |
| Categorical | 39 |
uf_de_nascimento_do_paciente has constant value "" | Constant |
uf_de_residencia_do_paciente has constant value "" | Constant |
historia_familiar_de_cancer_relacionado_a_sindrome_de_cancer_de_mama_e_ovario_hereditaria_choice_sim_2o_grau_apenas_1_caso has constant value "" | Constant |
historia_familiar_de_cancer_relacionado_a_sindrome_de_cancer_de_mama_e_ovario_hereditaria_choice_sim_2o_grau_mais_de_1_caso has constant value "" | Constant |
qual_metodo_choice_diu has constant value "" | Constant |
qual_metodo_choice_camisinha has constant value "" | Constant |
qual_metodo_choice_outros has constant value "" | Constant |
qual_metodo_choice_nao_informou has constant value "" | Constant |
hormonioterapia has constant value "" | Constant |
radioterapia has constant value "" | Constant |
data_da_ultima_informacao_sobre_o_paciente has a high cardinality: 2131 distinct values | High cardinality |
data_da_cirurgia has a high cardinality: 1653 distinct values | High cardinality |
data_de_inicio_do_tratamento_quimioterapia has a high cardinality: 1766 distinct values | High cardinality |
data_de_inicio_da_radioterapia has a high cardinality: 1708 distinct values | High cardinality |
sexo is highly imbalanced (92.9%) | Imbalance |
ja_ficou_gravida is highly imbalanced (91.4%) | Imbalance |
historia_familiar_de_cancer_relacionado_a_sindrome_de_cancer_de_mama_e_ovario_hereditaria_choice_nao is highly imbalanced (99.2%) | Imbalance |
historia_familiar_de_cancer_relacionado_a_sindrome_de_cancer_de_mama_e_ovario_hereditaria_choice_sim_1o_grau_apenas_1_caso is highly imbalanced (92.3%) | Imbalance |
historia_familiar_de_cancer_relacionado_a_sindrome_de_cancer_de_mama_e_ovario_hereditaria_choice_sim_1o_grau_mais_de_1_caso is highly imbalanced (98.0%) | Imbalance |
qual_metodo_choice_pilula_anticoncepcional is highly imbalanced (99.7%) | Imbalance |
ja_fez_uso_de_drogas is highly imbalanced (94.2%) | Imbalance |
consumo_de_alcool is highly imbalanced (51.1%) | Imbalance |
grau_de_parentesco_de_familiar_com_cancer_choice_primeiro_pais_irmaos_filhos is highly imbalanced (85.4%) | Imbalance |
grau_de_parentesco_de_familiar_com_cancer_choice_segundo_avos_tios_e_netos is highly imbalanced (87.9%) | Imbalance |
grau_de_parentesco_de_familiar_com_cancer_choice_terceiro_bisavos_tio_avos_primos_sobrinhos is highly imbalanced (91.1%) | Imbalance |
tipo_de_terapia_anti_her2_neoadjuvante is highly imbalanced (96.6%) | Imbalance |
repeat_instrument has 4272 (100.0%) missing values | Missing |
repeat_instance has 4272 (100.0%) missing values | Missing |
escolaridade has 215 (5.0%) missing values | Missing |
idade_do_paciente_ao_primeiro_diagnostico has 180 (4.2%) missing values | Missing |
sexo has 147 (3.4%) missing values | Missing |
raca_declarada_biobanco has 4038 (94.5%) missing values | Missing |
uf_de_nascimento_do_paciente has 4270 (> 99.9%) missing values | Missing |
uf_de_residencia_do_paciente has 4270 (> 99.9%) missing values | Missing |
ja_ficou_gravida has 3259 (76.3%) missing values | Missing |
quantas_vezes_ficou_gravida has 4228 (99.0%) missing values | Missing |
numero_de_partos has 4270 (> 99.9%) missing values | Missing |
idade_na_primeira_gestacao has 3375 (79.0%) missing values | Missing |
abortou has 4220 (98.8%) missing values | Missing |
amamentou_na_primeira_gestacao has 3230 (75.6%) missing values | Missing |
por_quanto_tempo_amamentou has 3584 (83.9%) missing values | Missing |
idade_da_primeira_mentruacao has 3247 (76.0%) missing values | Missing |
faz_uso_de_metodos_contraceptivo has 4269 (99.9%) missing values | Missing |
ja_fez_uso_de_drogas has 4123 (96.5%) missing values | Missing |
atividade_fisica has 3967 (92.9%) missing values | Missing |
consumo_de_tabaco has 4060 (95.0%) missing values | Missing |
consumo_de_alcool has 4068 (95.2%) missing values | Missing |
possui_historico_familiar_de_cancer has 4082 (95.6%) missing values | Missing |
regime_de_tratamento has 1409 (33.0%) missing values | Missing |
hormonioterapia has 4269 (99.9%) missing values | Missing |
data_da_cirurgia has 2056 (48.1%) missing values | Missing |
tipo_de_terapia_anti_her2_neoadjuvante has 3138 (73.5%) missing values | Missing |
radioterapia has 1947 (45.6%) missing values | Missing |
data_de_inicio_do_tratamento_quimioterapia has 1450 (33.9%) missing values | Missing |
esquema_de_hormonioterapia has 4260 (99.7%) missing values | Missing |
data_do_inicio_hormonioterapia_adjuvante has 4270 (> 99.9%) missing values | Missing |
data_de_inicio_da_radioterapia has 1949 (45.6%) missing values | Missing |
data_da_ultima_informacao_sobre_o_paciente is uniformly distributed | Uniform |
numero_de_partos is uniformly distributed | Uniform |
data_da_cirurgia is uniformly distributed | Uniform |
data_de_inicio_do_tratamento_quimioterapia is uniformly distributed | Uniform |
data_do_inicio_hormonioterapia_adjuvante is uniformly distributed | Uniform |
data_de_inicio_da_radioterapia is uniformly distributed | Uniform |
record_id has unique values | Unique |
repeat_instrument is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
repeat_instance is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Reproduction
| Analysis started | 2023-02-28 14:18:48.620080 |
|---|---|
| Analysis finished | 2023-02-28 14:19:08.409659 |
| Duration | 19.79 seconds |
| Software version | ydata-profiling vv4.0.0 |
| Download configuration | config.json |
record_id
Real number (ℝ)
| Distinct | 4272 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 48652.36 |
| Minimum | 302 |
|---|---|
| Maximum | 82240 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 33.5 KiB |
Quantile statistics
| Minimum | 302 |
|---|---|
| 5-th percentile | 13992.4 |
| Q1 | 31013 |
| median | 53394 |
| Q3 | 65816.75 |
| 95-th percentile | 78668.25 |
| Maximum | 82240 |
| Range | 81938 |
| Interquartile range (IQR) | 34803.75 |
Descriptive statistics
| Standard deviation | 20659.52 |
|---|---|
| Coefficient of variation (CV) | 0.4246355 |
| Kurtosis | -0.99374558 |
| Mean | 48652.36 |
| Median Absolute Deviation (MAD) | 16732 |
| Skewness | -0.29501895 |
| Sum | 2.0784288 × 108 |
| Variance | 4.2681575 × 108 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 302 | 1 | < 0.1% |
| 60912 | 1 | < 0.1% |
| 60757 | 1 | < 0.1% |
| 60774 | 1 | < 0.1% |
| 60777 | 1 | < 0.1% |
| 60799 | 1 | < 0.1% |
| 60815 | 1 | < 0.1% |
| 60825 | 1 | < 0.1% |
| 60826 | 1 | < 0.1% |
| 60840 | 1 | < 0.1% |
| Other values (4262) | 4262 |
| Value | Count | Frequency (%) |
| 302 | 1 | |
| 710 | 1 | |
| 752 | 1 | |
| 1367 | 1 | |
| 1589 | 1 | |
| 1705 | 1 | |
| 1843 | 1 | |
| 1873 | 1 | |
| 1898 | 1 | |
| 1960 | 1 |
| Value | Count | Frequency (%) |
| 82240 | 1 | |
| 82205 | 1 | |
| 82131 | 1 | |
| 82124 | 1 | |
| 82123 | 1 | |
| 82122 | 1 | |
| 82118 | 1 | |
| 82112 | 1 | |
| 82111 | 1 | |
| 82100 | 1 |
repeat_instrument
Unsupported
MISSING  REJECTED  UNSUPPORTED 
| Missing | 4272 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 33.5 KiB |
repeat_instance
Unsupported
MISSING  REJECTED  UNSUPPORTED 
| Missing | 4272 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 33.5 KiB |
escolaridade
Categorical
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 215 |
| Missing (%) | 5.0% |
| Memory size | 33.5 KiB |
| IGNORADA | |
|---|---|
| ENSINO MÉDIO | |
| ENS. FUNDAMENTAL INCOMPLETO | |
| ENS. FUNDAMENTAL COMPLETO | |
| SUPERIOR | 174 |
Length
| Max length | 27 |
|---|---|
| Median length | 8 |
| Mean length | 12.089721 |
| Min length | 8 |
Characters and Unicode
| Total characters | 49048 |
|---|---|
| Distinct characters | 20 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | ENS. FUNDAMENTAL INCOMPLETO |
|---|---|
| 2nd row | ENSINO MÉDIO |
| 3rd row | ENS. FUNDAMENTAL INCOMPLETO |
| 4th row | ENS. FUNDAMENTAL INCOMPLETO |
| 5th row | ENS. FUNDAMENTAL COMPLETO |
Common Values
| Value | Count | Frequency (%) |
| IGNORADA | 2535 | |
| ENSINO MÉDIO | 488 | 11.4% |
| ENS. FUNDAMENTAL INCOMPLETO | 445 | 10.4% |
| ENS. FUNDAMENTAL COMPLETO | 357 | 8.4% |
| SUPERIOR | 174 | 4.1% |
| ANALFABETO | 58 | 1.4% |
| (Missing) | 215 | 5.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| ignorada | 2535 | |
| ens | 802 | 13.0% |
| fundamental | 802 | 13.0% |
| ensino | 488 | 7.9% |
| médio | 488 | 7.9% |
| incompleto | 445 | 7.2% |
| completo | 357 | 5.8% |
| superior | 174 | 2.8% |
| analfabeto | 58 | 0.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 6848 | |
| N | 6420 | |
| O | 5347 | |
| I | 4130 | |
| D | 3825 | 7.8% |
| E | 3126 | 6.4% |
| R | 2883 | 5.9% |
| G | 2535 | 5.2% |
| 2092 | 4.3% | |
| M | 2092 | 4.3% |
| Other values (10) | 9750 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 46154 | |
| Space Separator | 2092 | 4.3% |
| Other Punctuation | 802 | 1.6% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 6848 | |
| N | 6420 | |
| O | 5347 | |
| I | 4130 | |
| D | 3825 | |
| E | 3126 | |
| R | 2883 | 6.2% |
| G | 2535 | 5.5% |
| M | 2092 | 4.5% |
| T | 1662 | 3.6% |
| Other values (8) | 7286 |
Space Separator
| Value | Count | Frequency (%) |
| 2092 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 802 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 46154 | |
| Common | 2894 | 5.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 6848 | |
| N | 6420 | |
| O | 5347 | |
| I | 4130 | |
| D | 3825 | |
| E | 3126 | |
| R | 2883 | 6.2% |
| G | 2535 | 5.5% |
| M | 2092 | 4.5% |
| T | 1662 | 3.6% |
| Other values (8) | 7286 |
Common
| Value | Count | Frequency (%) |
| 2092 | ||
| . | 802 | 27.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 48560 | |
| None | 488 | 1.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 6848 | |
| N | 6420 | |
| O | 5347 | |
| I | 4130 | |
| D | 3825 | |
| E | 3126 | 6.4% |
| R | 2883 | 5.9% |
| G | 2535 | 5.2% |
| 2092 | 4.3% | |
| M | 2092 | 4.3% |
| Other values (9) | 9262 |
None
| Value | Count | Frequency (%) |
| É | 488 |
idade_do_paciente_ao_primeiro_diagnostico
Real number (ℝ)
| Distinct | 76 |
|---|---|
| Distinct (%) | 1.9% |
| Missing | 180 |
| Missing (%) | 4.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 54.247801 |
| Minimum | 22 |
|---|---|
| Maximum | 98 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 33.5 KiB |
Quantile statistics
| Minimum | 22 |
|---|---|
| 5-th percentile | 33 |
| Q1 | 45 |
| median | 54 |
| Q3 | 64 |
| 95-th percentile | 78 |
| Maximum | 98 |
| Range | 76 |
| Interquartile range (IQR) | 19 |
Descriptive statistics
| Standard deviation | 13.574088 |
|---|---|
| Coefficient of variation (CV) | 0.25022375 |
| Kurtosis | -0.39661078 |
| Mean | 54.247801 |
| Median Absolute Deviation (MAD) | 10 |
| Skewness | 0.19871813 |
| Sum | 221982 |
| Variance | 184.25586 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 50 | 122 | 2.9% |
| 55 | 121 | 2.8% |
| 47 | 121 | 2.8% |
| 57 | 119 | 2.8% |
| 58 | 118 | 2.8% |
| 48 | 116 | 2.7% |
| 53 | 116 | 2.7% |
| 51 | 111 | 2.6% |
| 45 | 109 | 2.6% |
| 56 | 109 | 2.6% |
| Other values (66) | 2930 | |
| (Missing) | 180 | 4.2% |
| Value | Count | Frequency (%) |
| 22 | 3 | 0.1% |
| 23 | 1 | < 0.1% |
| 24 | 9 | 0.2% |
| 25 | 12 | 0.3% |
| 26 | 11 | 0.3% |
| 27 | 14 | |
| 28 | 14 | |
| 29 | 20 | |
| 30 | 32 | |
| 31 | 33 |
| Value | Count | Frequency (%) |
| 98 | 1 | < 0.1% |
| 97 | 2 | < 0.1% |
| 96 | 1 | < 0.1% |
| 95 | 1 | < 0.1% |
| 93 | 2 | < 0.1% |
| 92 | 2 | < 0.1% |
| 91 | 5 | |
| 90 | 4 | |
| 89 | 4 | |
| 88 | 5 |
sexo
Categorical
IMBALANCE  MISSING 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 147 |
| Missing (%) | 3.4% |
| Memory size | 33.5 KiB |
| Feminino | |
|---|---|
| Masculino | 35 |
Length
| Max length | 9 |
|---|---|
| Median length | 8 |
| Mean length | 8.0084848 |
| Min length | 8 |
Characters and Unicode
| Total characters | 33035 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Feminino |
|---|---|
| 2nd row | Feminino |
| 3rd row | Feminino |
| 4th row | Feminino |
| 5th row | Feminino |
Common Values
| Value | Count | Frequency (%) |
| Feminino | 4090 | |
| Masculino | 35 | 0.8% |
| (Missing) | 147 | 3.4% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| feminino | 4090 | |
| masculino | 35 | 0.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 8215 | |
| n | 8215 | |
| o | 4125 | |
| F | 4090 | |
| e | 4090 | |
| m | 4090 | |
| M | 35 | 0.1% |
| a | 35 | 0.1% |
| s | 35 | 0.1% |
| c | 35 | 0.1% |
| Other values (2) | 70 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 28910 | |
| Uppercase Letter | 4125 | 12.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 8215 | |
| n | 8215 | |
| o | 4125 | |
| e | 4090 | |
| m | 4090 | |
| a | 35 | 0.1% |
| s | 35 | 0.1% |
| c | 35 | 0.1% |
| u | 35 | 0.1% |
| l | 35 | 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 4090 | |
| M | 35 | 0.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 33035 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 8215 | |
| n | 8215 | |
| o | 4125 | |
| F | 4090 | |
| e | 4090 | |
| m | 4090 | |
| M | 35 | 0.1% |
| a | 35 | 0.1% |
| s | 35 | 0.1% |
| c | 35 | 0.1% |
| Other values (2) | 70 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 33035 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 8215 | |
| n | 8215 | |
| o | 4125 | |
| F | 4090 | |
| e | 4090 | |
| m | 4090 | |
| M | 35 | 0.1% |
| a | 35 | 0.1% |
| s | 35 | 0.1% |
| c | 35 | 0.1% |
| Other values (2) | 70 | 0.2% |
raca_declarada_biobanco
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | 2.1% |
| Missing | 4038 |
| Missing (%) | 94.5% |
| Memory size | 33.5 KiB |
| Branco | |
|---|---|
| Pardo | |
| Negro | |
| Outro | |
| Asiático | 5 |
Length
| Max length | 8 |
|---|---|
| Median length | 5 |
| Mean length | 5.491453 |
| Min length | 5 |
Characters and Unicode
| Total characters | 1285 |
|---|---|
| Distinct characters | 18 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Branco |
|---|---|
| 2nd row | Branco |
| 3rd row | Branco |
| 4th row | Branco |
| 5th row | Pardo |
Common Values
| Value | Count | Frequency (%) |
| Branco | 100 | 2.3% |
| Pardo | 71 | 1.7% |
| Negro | 45 | 1.1% |
| Outro | 13 | 0.3% |
| Asiático | 5 | 0.1% |
| (Missing) | 4038 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| branco | 100 | |
| pardo | 71 | |
| negro | 45 | |
| outro | 13 | 5.6% |
| asiático | 5 | 2.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 234 | |
| r | 229 | |
| a | 171 | |
| c | 105 | |
| B | 100 | |
| n | 100 | |
| P | 71 | 5.5% |
| d | 71 | 5.5% |
| g | 45 | 3.5% |
| e | 45 | 3.5% |
| Other values (8) | 114 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1051 | |
| Uppercase Letter | 234 | 18.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 234 | |
| r | 229 | |
| a | 171 | |
| c | 105 | |
| n | 100 | |
| d | 71 | 6.8% |
| g | 45 | 4.3% |
| e | 45 | 4.3% |
| t | 18 | 1.7% |
| u | 13 | 1.2% |
| Other values (3) | 20 | 1.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 100 | |
| P | 71 | |
| N | 45 | |
| O | 13 | 5.6% |
| A | 5 | 2.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1285 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 234 | |
| r | 229 | |
| a | 171 | |
| c | 105 | |
| B | 100 | |
| n | 100 | |
| P | 71 | 5.5% |
| d | 71 | 5.5% |
| g | 45 | 3.5% |
| e | 45 | 3.5% |
| Other values (8) | 114 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1280 | |
| None | 5 | 0.4% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 234 | |
| r | 229 | |
| a | 171 | |
| c | 105 | |
| B | 100 | |
| n | 100 | |
| P | 71 | 5.5% |
| d | 71 | 5.5% |
| g | 45 | 3.5% |
| e | 45 | 3.5% |
| Other values (7) | 109 |
None
| Value | Count | Frequency (%) |
| á | 5 |
uf_de_nascimento_do_paciente
Categorical
CONSTANT  MISSING 
| Distinct | 1 |
|---|---|
| Distinct (%) | 50.0% |
| Missing | 4270 |
| Missing (%) | > 99.9% |
| Memory size | 33.5 KiB |
| SP |
|---|
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 4 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | SP |
|---|---|
| 2nd row | SP |
Common Values
| Value | Count | Frequency (%) |
| SP | 2 | < 0.1% |
| (Missing) | 4270 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| sp | 2 |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 2 | |
| P | 2 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 4 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 2 | |
| P | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 2 | |
| P | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| S | 2 | |
| P | 2 |
uf_de_residencia_do_paciente
Categorical
CONSTANT  MISSING 
| Distinct | 1 |
|---|---|
| Distinct (%) | 50.0% |
| Missing | 4270 |
| Missing (%) | > 99.9% |
| Memory size | 33.5 KiB |
| SP |
|---|
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 4 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | SP |
|---|---|
| 2nd row | SP |
Common Values
| Value | Count | Frequency (%) |
| SP | 2 | < 0.1% |
| (Missing) | 4270 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| sp | 2 |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 2 | |
| P | 2 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 4 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 2 | |
| P | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 2 | |
| P | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| S | 2 | |
| P | 2 |
data_da_ultima_informacao_sobre_o_paciente
Categorical
HIGH CARDINALITY  UNIFORM 
| Distinct | 2131 |
|---|---|
| Distinct (%) | 49.9% |
| Missing | 2 |
| Missing (%) | < 0.1% |
| Memory size | 33.5 KiB |
| 2020-07-08 | 9 |
|---|---|
| 2020-05-26 | 7 |
| 2020-01-25 | 7 |
| 2019-12-26 | 7 |
| 2020-04-30 | 7 |
| Other values (2126) |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 42700 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1033 ? |
|---|---|
| Unique (%) | 24.2% |
Sample
| 1st row | 2014-04-26 |
|---|---|
| 2nd row | 2016-11-17 |
| 3rd row | 2019-05-02 |
| 4th row | 2011-09-29 |
| 5th row | 2017-05-24 |
Common Values
| Value | Count | Frequency (%) |
| 2020-07-08 | 9 | 0.2% |
| 2020-05-26 | 7 | 0.2% |
| 2020-01-25 | 7 | 0.2% |
| 2019-12-26 | 7 | 0.2% |
| 2020-04-30 | 7 | 0.2% |
| 2020-09-17 | 7 | 0.2% |
| 2019-08-06 | 7 | 0.2% |
| 2020-09-18 | 7 | 0.2% |
| 2019-09-18 | 7 | 0.2% |
| 2019-08-18 | 7 | 0.2% |
| Other values (2121) | 4198 |
Length
| Value | Count | Frequency (%) |
| 2020-07-08 | 9 | 0.2% |
| 2019-08-06 | 7 | 0.2% |
| 2020-05-26 | 7 | 0.2% |
| 2019-09-18 | 7 | 0.2% |
| 2020-09-18 | 7 | 0.2% |
| 2019-08-18 | 7 | 0.2% |
| 2020-09-17 | 7 | 0.2% |
| 2020-04-30 | 7 | 0.2% |
| 2019-12-26 | 7 | 0.2% |
| 2020-01-25 | 7 | 0.2% |
| Other values (2121) | 4198 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 10538 | |
| - | 8540 | |
| 2 | 8343 | |
| 1 | 6938 | |
| 9 | 1765 | 4.1% |
| 8 | 1294 | 3.0% |
| 7 | 1289 | 3.0% |
| 3 | 1143 | 2.7% |
| 6 | 1012 | 2.4% |
| 5 | 975 | 2.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 34160 | |
| Dash Punctuation | 8540 | 20.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 10538 | |
| 2 | 8343 | |
| 1 | 6938 | |
| 9 | 1765 | 5.2% |
| 8 | 1294 | 3.8% |
| 7 | 1289 | 3.8% |
| 3 | 1143 | 3.3% |
| 6 | 1012 | 3.0% |
| 5 | 975 | 2.9% |
| 4 | 863 | 2.5% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 8540 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 42700 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 10538 | |
| - | 8540 | |
| 2 | 8343 | |
| 1 | 6938 | |
| 9 | 1765 | 4.1% |
| 8 | 1294 | 3.0% |
| 7 | 1289 | 3.0% |
| 3 | 1143 | 2.7% |
| 6 | 1012 | 2.4% |
| 5 | 975 | 2.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 42700 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 10538 | |
| - | 8540 | |
| 2 | 8343 | |
| 1 | 6938 | |
| 9 | 1765 | 4.1% |
| 8 | 1294 | 3.0% |
| 7 | 1289 | 3.0% |
| 3 | 1143 | 2.7% |
| 6 | 1012 | 2.4% |
| 5 | 975 | 2.3% |
ultima_informacao_do_paciente
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 2 |
| Missing (%) | < 0.1% |
| Memory size | 33.5 KiB |
| Vivo, SOE | |
|---|---|
| Obito por câncer | |
| Vivo, com câncer | 235 |
| Óbito por outras causas, SOE | 89 |
Length
| Max length | 28 |
|---|---|
| Median length | 9 |
| Mean length | 11.635363 |
| Min length | 9 |
Characters and Unicode
| Total characters | 49683 |
|---|---|
| Distinct characters | 22 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Obito por câncer |
|---|---|
| 2nd row | Vivo, SOE |
| 3rd row | Vivo, SOE |
| 4th row | Obito por câncer |
| 5th row | Vivo, SOE |
Common Values
| Value | Count | Frequency (%) |
| Vivo, SOE | 2815 | |
| Obito por câncer | 1131 | |
| Vivo, com câncer | 235 | 5.5% |
| Óbito por outras causas, SOE | 89 | 2.1% |
| (Missing) | 2 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| vivo | 3050 | |
| soe | 2904 | |
| câncer | 1366 | |
| por | 1220 | 12.0% |
| obito | 1131 | 11.1% |
| com | 235 | 2.3% |
| óbito | 89 | 0.9% |
| outras | 89 | 0.9% |
| causas | 89 | 0.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 5903 | ||
| o | 5814 | |
| i | 4270 | 8.6% |
| O | 4035 | 8.1% |
| , | 3139 | 6.3% |
| c | 3056 | 6.2% |
| V | 3050 | 6.1% |
| v | 3050 | 6.1% |
| S | 2904 | 5.8% |
| E | 2904 | 5.8% |
| Other values (12) | 11558 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 27659 | |
| Uppercase Letter | 12982 | |
| Space Separator | 5903 | 11.9% |
| Other Punctuation | 3139 | 6.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 5814 | |
| i | 4270 | |
| c | 3056 | |
| v | 3050 | |
| r | 2675 | |
| n | 1366 | 4.9% |
| e | 1366 | 4.9% |
| â | 1366 | 4.9% |
| t | 1309 | 4.7% |
| p | 1220 | 4.4% |
| Other values (5) | 2167 | 7.8% |
Uppercase Letter
| Value | Count | Frequency (%) |
| O | 4035 | |
| V | 3050 | |
| S | 2904 | |
| E | 2904 | |
| Ó | 89 | 0.7% |
Space Separator
| Value | Count | Frequency (%) |
| 5903 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 3139 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 40641 | |
| Common | 9042 | 18.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 5814 | |
| i | 4270 | |
| O | 4035 | |
| c | 3056 | |
| V | 3050 | |
| v | 3050 | |
| S | 2904 | 7.1% |
| E | 2904 | 7.1% |
| r | 2675 | 6.6% |
| n | 1366 | 3.4% |
| Other values (10) | 7517 |
Common
| Value | Count | Frequency (%) |
| 5903 | ||
| , | 3139 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 48228 | |
| None | 1455 | 2.9% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 5903 | ||
| o | 5814 | |
| i | 4270 | |
| O | 4035 | 8.4% |
| , | 3139 | 6.5% |
| c | 3056 | 6.3% |
| V | 3050 | 6.3% |
| v | 3050 | 6.3% |
| S | 2904 | 6.0% |
| E | 2904 | 6.0% |
| Other values (10) | 10103 |
None
| Value | Count | Frequency (%) |
| â | 1366 | |
| Ó | 89 | 6.1% |
tempo_de_seguimento_em_dias_desde_o_ultimo_tumor_no_caso_de_tumores_multiplos_dt_pci
Real number (ℝ)
| Distinct | 2071 |
|---|---|
| Distinct (%) | 48.5% |
| Missing | 2 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1475.0037 |
| Minimum | 0 |
|---|---|
| Maximum | 4503 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 33.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 359 |
| Q1 | 956.25 |
| median | 1282 |
| Q3 | 1817.75 |
| 95-th percentile | 3272.55 |
| Maximum | 4503 |
| Range | 4503 |
| Interquartile range (IQR) | 861.5 |
Descriptive statistics
| Standard deviation | 859.62238 |
|---|---|
| Coefficient of variation (CV) | 0.58279335 |
| Kurtosis | 0.33156452 |
| Mean | 1475.0037 |
| Median Absolute Deviation (MAD) | 397 |
| Skewness | 0.9481852 |
| Sum | 6298266 |
| Variance | 738950.63 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1417 | 14 | 0.3% |
| 1407 | 14 | 0.3% |
| 1408 | 13 | 0.3% |
| 1435 | 12 | 0.3% |
| 1162 | 12 | 0.3% |
| 1379 | 11 | 0.3% |
| 1189 | 11 | 0.3% |
| 1412 | 11 | 0.3% |
| 1406 | 11 | 0.3% |
| 1404 | 10 | 0.2% |
| Other values (2061) | 4151 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 6 | 1 | |
| 8 | 1 | |
| 9 | 1 | |
| 10 | 1 | |
| 13 | 1 | |
| 16 | 1 | |
| 25 | 1 | |
| 30 | 1 | |
| 31 | 1 |
| Value | Count | Frequency (%) |
| 4503 | 1 | |
| 4474 | 1 | |
| 4395 | 1 | |
| 4381 | 1 | |
| 4330 | 1 | |
| 4326 | 1 | |
| 4295 | 1 | |
| 4277 | 1 | |
| 4235 | 1 | |
| 4231 | 1 |
ja_ficou_gravida
Categorical
IMBALANCE  MISSING 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 3259 |
| Missing (%) | 76.3% |
| Memory size | 33.5 KiB |
| Sim | |
|---|---|
| Não | 11 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 3039 |
|---|---|
| Distinct characters | 6 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Sim |
|---|---|
| 2nd row | Sim |
| 3rd row | Sim |
| 4th row | Sim |
| 5th row | Não |
Common Values
| Value | Count | Frequency (%) |
| Sim | 1002 | 23.5% |
| Não | 11 | 0.3% |
| (Missing) | 3259 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| sim | 1002 | |
| não | 11 | 1.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 1002 | |
| i | 1002 | |
| m | 1002 | |
| N | 11 | 0.4% |
| ã | 11 | 0.4% |
| o | 11 | 0.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2026 | |
| Uppercase Letter | 1013 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 1002 | |
| m | 1002 | |
| ã | 11 | 0.5% |
| o | 11 | 0.5% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 1002 | |
| N | 11 | 1.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3039 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 1002 | |
| i | 1002 | |
| m | 1002 | |
| N | 11 | 0.4% |
| ã | 11 | 0.4% |
| o | 11 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3028 | |
| None | 11 | 0.4% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| S | 1002 | |
| i | 1002 | |
| m | 1002 | |
| N | 11 | 0.4% |
| o | 11 | 0.4% |
None
| Value | Count | Frequency (%) |
| ã | 11 |
quantas_vezes_ficou_gravida
Real number (ℝ)
| Distinct | 6 |
|---|---|
| Distinct (%) | 13.6% |
| Missing | 4228 |
| Missing (%) | 99.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.3181818 |
| Minimum | 1 |
|---|---|
| Maximum | 7 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 33.5 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 2 |
| Q3 | 3 |
| 95-th percentile | 5 |
| Maximum | 7 |
| Range | 6 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.4104713 |
|---|---|
| Coefficient of variation (CV) | 0.60843858 |
| Kurtosis | 1.5772173 |
| Mean | 2.3181818 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.2235794 |
| Sum | 102 |
| Variance | 1.9894292 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 16 | 0.4% |
| 2 | 11 | 0.3% |
| 3 | 10 | 0.2% |
| 4 | 3 | 0.1% |
| 5 | 3 | 0.1% |
| 7 | 1 | < 0.1% |
| (Missing) | 4228 |
| Value | Count | Frequency (%) |
| 1 | 16 | |
| 2 | 11 | |
| 3 | 10 | |
| 4 | 3 | 0.1% |
| 5 | 3 | 0.1% |
| 7 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 7 | 1 | < 0.1% |
| 5 | 3 | 0.1% |
| 4 | 3 | 0.1% |
| 3 | 10 | |
| 2 | 11 | |
| 1 | 16 |
numero_de_partos
Categorical
MISSING  UNIFORM 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 4270 |
| Missing (%) | > 99.9% |
| Memory size | 33.5 KiB |
| 2.0 | |
|---|---|
| 1.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 6 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 2.0 |
|---|---|
| 2nd row | 1.0 |
Common Values
| Value | Count | Frequency (%) |
| 2.0 | 1 | < 0.1% |
| 1.0 | 1 | < 0.1% |
| (Missing) | 4270 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2.0 | 1 | |
| 1.0 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 2 | |
| 0 | 2 | |
| 2 | 1 | |
| 1 | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4 | |
| Other Punctuation | 2 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2 | |
| 2 | 1 | |
| 1 | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 6 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 2 | |
| 0 | 2 | |
| 2 | 1 | |
| 1 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 2 | |
| 0 | 2 | |
| 2 | 1 | |
| 1 | 1 |
idade_na_primeira_gestacao
Real number (ℝ)
| Distinct | 36 |
|---|---|
| Distinct (%) | 4.0% |
| Missing | 3375 |
| Missing (%) | 79.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 23.057971 |
| Minimum | 0 |
|---|---|
| Maximum | 53 |
| Zeros | 2 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 33.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 16 |
| Q1 | 19 |
| median | 22 |
| Q3 | 26 |
| 95-th percentile | 33 |
| Maximum | 53 |
| Range | 53 |
| Interquartile range (IQR) | 7 |
Descriptive statistics
| Standard deviation | 5.6652317 |
|---|---|
| Coefficient of variation (CV) | 0.24569515 |
| Kurtosis | 1.7550002 |
| Mean | 23.057971 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 0.79315429 |
| Sum | 20683 |
| Variance | 32.09485 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 19 | 90 | 2.1% |
| 21 | 75 | 1.8% |
| 20 | 75 | 1.8% |
| 23 | 69 | 1.6% |
| 22 | 61 | 1.4% |
| 18 | 61 | 1.4% |
| 24 | 53 | 1.2% |
| 17 | 51 | 1.2% |
| 26 | 44 | 1.0% |
| 25 | 39 | 0.9% |
| Other values (26) | 279 | 6.5% |
| (Missing) | 3375 |
| Value | Count | Frequency (%) |
| 0 | 2 | < 0.1% |
| 11 | 1 | < 0.1% |
| 12 | 2 | < 0.1% |
| 13 | 5 | 0.1% |
| 14 | 8 | 0.2% |
| 15 | 17 | 0.4% |
| 16 | 28 | 0.7% |
| 17 | 51 | |
| 18 | 61 | |
| 19 | 90 |
| Value | Count | Frequency (%) |
| 53 | 1 | < 0.1% |
| 45 | 2 | < 0.1% |
| 44 | 1 | < 0.1% |
| 42 | 2 | < 0.1% |
| 41 | 1 | < 0.1% |
| 40 | 1 | < 0.1% |
| 39 | 2 | < 0.1% |
| 38 | 3 | |
| 37 | 7 | |
| 36 | 5 |
abortou
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 3.8% |
| Missing | 4220 |
| Missing (%) | 98.8% |
| Memory size | 33.5 KiB |
| Não | |
|---|---|
| Sim |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 156 |
|---|---|
| Distinct characters | 6 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Não |
|---|---|
| 2nd row | Não |
| 3rd row | Não |
| 4th row | Sim |
| 5th row | Não |
Common Values
| Value | Count | Frequency (%) |
| Não | 41 | 1.0% |
| Sim | 11 | 0.3% |
| (Missing) | 4220 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| não | 41 | |
| sim | 11 | 21.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 41 | |
| ã | 41 | |
| o | 41 | |
| S | 11 | 7.1% |
| i | 11 | 7.1% |
| m | 11 | 7.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 104 | |
| Uppercase Letter | 52 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| ã | 41 | |
| o | 41 | |
| i | 11 | 10.6% |
| m | 11 | 10.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 41 | |
| S | 11 | 21.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 156 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 41 | |
| ã | 41 | |
| o | 41 | |
| S | 11 | 7.1% |
| i | 11 | 7.1% |
| m | 11 | 7.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 115 | |
| None | 41 | 26.3% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 41 | |
| o | 41 | |
| S | 11 | 9.6% |
| i | 11 | 9.6% |
| m | 11 | 9.6% |
None
| Value | Count | Frequency (%) |
| ã | 41 |
amamentou_na_primeira_gestacao
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 3230 |
| Missing (%) | 75.6% |
| Memory size | 33.5 KiB |
| Sim | |
|---|---|
| Não |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 3126 |
|---|---|
| Distinct characters | 6 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Sim |
|---|---|
| 2nd row | Sim |
| 3rd row | Não |
| 4th row | Sim |
| 5th row | Sim |
Common Values
| Value | Count | Frequency (%) |
| Sim | 789 | 18.5% |
| Não | 253 | 5.9% |
| (Missing) | 3230 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| sim | 789 | |
| não | 253 | 24.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 789 | |
| i | 789 | |
| m | 789 | |
| N | 253 | 8.1% |
| ã | 253 | 8.1% |
| o | 253 | 8.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2084 | |
| Uppercase Letter | 1042 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 789 | |
| m | 789 | |
| ã | 253 | 12.1% |
| o | 253 | 12.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 789 | |
| N | 253 | 24.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3126 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 789 | |
| i | 789 | |
| m | 789 | |
| N | 253 | 8.1% |
| ã | 253 | 8.1% |
| o | 253 | 8.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2873 | |
| None | 253 | 8.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| S | 789 | |
| i | 789 | |
| m | 789 | |
| N | 253 | 8.8% |
| o | 253 | 8.8% |
None
| Value | Count | Frequency (%) |
| ã | 253 |
por_quanto_tempo_amamentou
Real number (ℝ)
| Distinct | 56 |
|---|---|
| Distinct (%) | 8.1% |
| Missing | 3584 |
| Missing (%) | 83.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 19.043605 |
| Minimum | 0 |
|---|---|
| Maximum | 260 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 33.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 6 |
| median | 12 |
| Q3 | 24 |
| 95-th percentile | 60 |
| Maximum | 260 |
| Range | 260 |
| Interquartile range (IQR) | 18 |
Descriptive statistics
| Standard deviation | 23.10506 |
|---|---|
| Coefficient of variation (CV) | 1.2132714 |
| Kurtosis | 33.704156 |
| Mean | 19.043605 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 4.4078431 |
| Sum | 13102 |
| Variance | 533.8438 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 12 | 88 | 2.1% |
| 6 | 79 | 1.8% |
| 24 | 75 | 1.8% |
| 36 | 49 | 1.1% |
| 3 | 46 | 1.1% |
| 4 | 38 | 0.9% |
| 2 | 37 | 0.9% |
| 8 | 25 | 0.6% |
| 1 | 24 | 0.6% |
| 18 | 23 | 0.5% |
| Other values (46) | 204 | 4.8% |
| (Missing) | 3584 |
| Value | Count | Frequency (%) |
| 0 | 1 | < 0.1% |
| 1 | 24 | 0.6% |
| 2 | 37 | |
| 3 | 46 | |
| 4 | 38 | |
| 5 | 14 | 0.3% |
| 6 | 79 | |
| 7 | 13 | 0.3% |
| 8 | 25 | 0.6% |
| 9 | 11 | 0.3% |
| Value | Count | Frequency (%) |
| 260 | 1 | < 0.1% |
| 240 | 1 | < 0.1% |
| 178 | 1 | < 0.1% |
| 150 | 1 | < 0.1% |
| 120 | 1 | < 0.1% |
| 100 | 1 | < 0.1% |
| 96 | 1 | < 0.1% |
| 84 | 3 | |
| 82 | 1 | < 0.1% |
| 80 | 1 | < 0.1% |
historia_familiar_de_cancer_relacionado_a_sindrome_de_cancer_de_mama_e_ovario_hereditaria_choice_nao
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 33.5 KiB |
| Unchecked | |
|---|---|
| Checked | 3 |
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 8.9985955 |
| Min length | 7 |
Characters and Unicode
| Total characters | 38442 |
|---|---|
| Distinct characters | 8 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Unchecked |
|---|---|
| 2nd row | Unchecked |
| 3rd row | Unchecked |
| 4th row | Unchecked |
| 5th row | Unchecked |
Common Values
| Value | Count | Frequency (%) |
| Unchecked | 4269 | |
| Checked | 3 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| unchecked | 4269 | |
| checked | 3 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 8544 | |
| c | 8541 | |
| h | 4272 | |
| k | 4272 | |
| d | 4272 | |
| U | 4269 | |
| n | 4269 | |
| C | 3 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 34170 | |
| Uppercase Letter | 4272 | 11.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 8544 | |
| c | 8541 | |
| h | 4272 | |
| k | 4272 | |
| d | 4272 | |
| n | 4269 |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 4269 | |
| C | 3 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 38442 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 8544 | |
| c | 8541 | |
| h | 4272 | |
| k | 4272 | |
| d | 4272 | |
| U | 4269 | |
| n | 4269 | |
| C | 3 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 38442 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 8544 | |
| c | 8541 | |
| h | 4272 | |
| k | 4272 | |
| d | 4272 | |
| U | 4269 | |
| n | 4269 | |
| C | 3 | < 0.1% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 33.5 KiB |
| Unchecked | |
|---|---|
| Checked | 40 |
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 8.9812734 |
| Min length | 7 |
Characters and Unicode
| Total characters | 38368 |
|---|---|
| Distinct characters | 8 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Unchecked |
|---|---|
| 2nd row | Unchecked |
| 3rd row | Unchecked |
| 4th row | Unchecked |
| 5th row | Unchecked |
Common Values
| Value | Count | Frequency (%) |
| Unchecked | 4232 | |
| Checked | 40 | 0.9% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| unchecked | 4232 | |
| checked | 40 | 0.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 8544 | |
| c | 8504 | |
| h | 4272 | |
| k | 4272 | |
| d | 4272 | |
| U | 4232 | |
| n | 4232 | |
| C | 40 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 34096 | |
| Uppercase Letter | 4272 | 11.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 8544 | |
| c | 8504 | |
| h | 4272 | |
| k | 4272 | |
| d | 4272 | |
| n | 4232 |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 4232 | |
| C | 40 | 0.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 38368 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 8544 | |
| c | 8504 | |
| h | 4272 | |
| k | 4272 | |
| d | 4272 | |
| U | 4232 | |
| n | 4232 | |
| C | 40 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 38368 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 8544 | |
| c | 8504 | |
| h | 4272 | |
| k | 4272 | |
| d | 4272 | |
| U | 4232 | |
| n | 4232 | |
| C | 40 | 0.1% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 33.5 KiB |
| Unchecked | |
|---|---|
| Checked | 8 |
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 8.9962547 |
| Min length | 7 |
Characters and Unicode
| Total characters | 38432 |
|---|---|
| Distinct characters | 8 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Unchecked |
|---|---|
| 2nd row | Unchecked |
| 3rd row | Unchecked |
| 4th row | Unchecked |
| 5th row | Unchecked |
Common Values
| Value | Count | Frequency (%) |
| Unchecked | 4264 | |
| Checked | 8 | 0.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| unchecked | 4264 | |
| checked | 8 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 8544 | |
| c | 8536 | |
| h | 4272 | |
| k | 4272 | |
| d | 4272 | |
| U | 4264 | |
| n | 4264 | |
| C | 8 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 34160 | |
| Uppercase Letter | 4272 | 11.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 8544 | |
| c | 8536 | |
| h | 4272 | |
| k | 4272 | |
| d | 4272 | |
| n | 4264 |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 4264 | |
| C | 8 | 0.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 38432 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 8544 | |
| c | 8536 | |
| h | 4272 | |
| k | 4272 | |
| d | 4272 | |
| U | 4264 | |
| n | 4264 | |
| C | 8 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 38432 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 8544 | |
| c | 8536 | |
| h | 4272 | |
| k | 4272 | |
| d | 4272 | |
| U | 4264 | |
| n | 4264 | |
| C | 8 | < 0.1% |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 33.5 KiB |
| Unchecked |
|---|
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 9 |
| Min length | 9 |
Characters and Unicode
| Total characters | 38448 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Unchecked |
|---|---|
| 2nd row | Unchecked |
| 3rd row | Unchecked |
| 4th row | Unchecked |
| 5th row | Unchecked |
Common Values
| Value | Count | Frequency (%) |
| Unchecked | 4272 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| unchecked | 4272 |
Most occurring characters
| Value | Count | Frequency (%) |
| c | 8544 | |
| e | 8544 | |
| U | 4272 | |
| n | 4272 | |
| h | 4272 | |
| k | 4272 | |
| d | 4272 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 34176 | |
| Uppercase Letter | 4272 | 11.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| c | 8544 | |
| e | 8544 | |
| n | 4272 | |
| h | 4272 | |
| k | 4272 | |
| d | 4272 |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 4272 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 38448 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| c | 8544 | |
| e | 8544 | |
| U | 4272 | |
| n | 4272 | |
| h | 4272 | |
| k | 4272 | |
| d | 4272 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 38448 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| c | 8544 | |
| e | 8544 | |
| U | 4272 | |
| n | 4272 | |
| h | 4272 | |
| k | 4272 | |
| d | 4272 |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 33.5 KiB |
| Unchecked |
|---|
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 9 |
| Min length | 9 |
Characters and Unicode
| Total characters | 38448 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Unchecked |
|---|---|
| 2nd row | Unchecked |
| 3rd row | Unchecked |
| 4th row | Unchecked |
| 5th row | Unchecked |
Common Values
| Value | Count | Frequency (%) |
| Unchecked | 4272 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| unchecked | 4272 |
Most occurring characters
| Value | Count | Frequency (%) |
| c | 8544 | |
| e | 8544 | |
| U | 4272 | |
| n | 4272 | |
| h | 4272 | |
| k | 4272 | |
| d | 4272 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 34176 | |
| Uppercase Letter | 4272 | 11.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| c | 8544 | |
| e | 8544 | |
| n | 4272 | |
| h | 4272 | |
| k | 4272 | |
| d | 4272 |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 4272 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 38448 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| c | 8544 | |
| e | 8544 | |
| U | 4272 | |
| n | 4272 | |
| h | 4272 | |
| k | 4272 | |
| d | 4272 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 38448 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| c | 8544 | |
| e | 8544 | |
| U | 4272 | |
| n | 4272 | |
| h | 4272 | |
| k | 4272 | |
| d | 4272 |
idade_da_primeira_mentruacao
Real number (ℝ)
| Distinct | 17 |
|---|---|
| Distinct (%) | 1.7% |
| Missing | 3247 |
| Missing (%) | 76.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 12.891707 |
| Minimum | 0 |
|---|---|
| Maximum | 37 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 33.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 10 |
| Q1 | 12 |
| median | 13 |
| Q3 | 14 |
| 95-th percentile | 16 |
| Maximum | 37 |
| Range | 37 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 2.1044455 |
|---|---|
| Coefficient of variation (CV) | 0.16324025 |
| Kurtosis | 21.516307 |
| Mean | 12.891707 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.9044226 |
| Sum | 13214 |
| Variance | 4.4286909 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 12 | 225 | 5.3% |
| 13 | 213 | 5.0% |
| 14 | 162 | 3.8% |
| 11 | 154 | 3.6% |
| 15 | 115 | 2.7% |
| 16 | 43 | 1.0% |
| 10 | 40 | 0.9% |
| 9 | 33 | 0.8% |
| 17 | 23 | 0.5% |
| 18 | 7 | 0.2% |
| Other values (7) | 10 | 0.2% |
| (Missing) | 3247 |
| Value | Count | Frequency (%) |
| 0 | 1 | < 0.1% |
| 7 | 1 | < 0.1% |
| 8 | 3 | 0.1% |
| 9 | 33 | 0.8% |
| 10 | 40 | 0.9% |
| 11 | 154 | |
| 12 | 225 | |
| 13 | 213 | |
| 14 | 162 | |
| 15 | 115 |
| Value | Count | Frequency (%) |
| 37 | 1 | < 0.1% |
| 30 | 1 | < 0.1% |
| 20 | 1 | < 0.1% |
| 19 | 2 | < 0.1% |
| 18 | 7 | 0.2% |
| 17 | 23 | 0.5% |
| 16 | 43 | 1.0% |
| 15 | 115 | |
| 14 | 162 | |
| 13 | 213 |
faz_uso_de_metodos_contraceptivo
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 66.7% |
| Missing | 4269 |
| Missing (%) | 99.9% |
| Memory size | 33.5 KiB |
| Não | |
|---|---|
| Sim |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 9 |
|---|---|
| Distinct characters | 6 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 33.3% |
Sample
| 1st row | Não |
|---|---|
| 2nd row | Sim |
| 3rd row | Não |
Common Values
| Value | Count | Frequency (%) |
| Não | 2 | < 0.1% |
| Sim | 1 | < 0.1% |
| (Missing) | 4269 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| não | 2 | |
| sim | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 2 | |
| ã | 2 | |
| o | 2 | |
| S | 1 | |
| i | 1 | |
| m | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6 | |
| Uppercase Letter | 3 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| ã | 2 | |
| o | 2 | |
| i | 1 | |
| m | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 2 | |
| S | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 9 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 2 | |
| ã | 2 | |
| o | 2 | |
| S | 1 | |
| i | 1 | |
| m | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7 | |
| None | 2 | 22.2% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 2 | |
| o | 2 | |
| S | 1 | |
| i | 1 | |
| m | 1 |
None
| Value | Count | Frequency (%) |
| ã | 2 |
qual_metodo_choice_pilula_anticoncepcional
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 33.5 KiB |
| Unchecked | |
|---|---|
| Checked | 1 |
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 8.9995318 |
| Min length | 7 |
Characters and Unicode
| Total characters | 38446 |
|---|---|
| Distinct characters | 8 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Unchecked |
|---|---|
| 2nd row | Unchecked |
| 3rd row | Unchecked |
| 4th row | Unchecked |
| 5th row | Unchecked |
Common Values
| Value | Count | Frequency (%) |
| Unchecked | 4271 | |
| Checked | 1 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| unchecked | 4271 | |
| checked | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 8544 | |
| c | 8543 | |
| h | 4272 | |
| k | 4272 | |
| d | 4272 | |
| U | 4271 | |
| n | 4271 | |
| C | 1 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 34174 | |
| Uppercase Letter | 4272 | 11.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 8544 | |
| c | 8543 | |
| h | 4272 | |
| k | 4272 | |
| d | 4272 | |
| n | 4271 |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 4271 | |
| C | 1 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 38446 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 8544 | |
| c | 8543 | |
| h | 4272 | |
| k | 4272 | |
| d | 4272 | |
| U | 4271 | |
| n | 4271 | |
| C | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 38446 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 8544 | |
| c | 8543 | |
| h | 4272 | |
| k | 4272 | |
| d | 4272 | |
| U | 4271 | |
| n | 4271 | |
| C | 1 | < 0.1% |
qual_metodo_choice_diu
Categorical
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 33.5 KiB |
| Unchecked |
|---|
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 9 |
| Min length | 9 |
Characters and Unicode
| Total characters | 38448 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Unchecked |
|---|---|
| 2nd row | Unchecked |
| 3rd row | Unchecked |
| 4th row | Unchecked |
| 5th row | Unchecked |
Common Values
| Value | Count | Frequency (%) |
| Unchecked | 4272 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| unchecked | 4272 |
Most occurring characters
| Value | Count | Frequency (%) |
| c | 8544 | |
| e | 8544 | |
| U | 4272 | |
| n | 4272 | |
| h | 4272 | |
| k | 4272 | |
| d | 4272 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 34176 | |
| Uppercase Letter | 4272 | 11.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| c | 8544 | |
| e | 8544 | |
| n | 4272 | |
| h | 4272 | |
| k | 4272 | |
| d | 4272 |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 4272 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 38448 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| c | 8544 | |
| e | 8544 | |
| U | 4272 | |
| n | 4272 | |
| h | 4272 | |
| k | 4272 | |
| d | 4272 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 38448 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| c | 8544 | |
| e | 8544 | |
| U | 4272 | |
| n | 4272 | |
| h | 4272 | |
| k | 4272 | |
| d | 4272 |
qual_metodo_choice_camisinha
Categorical
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 33.5 KiB |
| Unchecked |
|---|
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 9 |
| Min length | 9 |
Characters and Unicode
| Total characters | 38448 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Unchecked |
|---|---|
| 2nd row | Unchecked |
| 3rd row | Unchecked |
| 4th row | Unchecked |
| 5th row | Unchecked |
Common Values
| Value | Count | Frequency (%) |
| Unchecked | 4272 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| unchecked | 4272 |
Most occurring characters
| Value | Count | Frequency (%) |
| c | 8544 | |
| e | 8544 | |
| U | 4272 | |
| n | 4272 | |
| h | 4272 | |
| k | 4272 | |
| d | 4272 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 34176 | |
| Uppercase Letter | 4272 | 11.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| c | 8544 | |
| e | 8544 | |
| n | 4272 | |
| h | 4272 | |
| k | 4272 | |
| d | 4272 |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 4272 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 38448 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| c | 8544 | |
| e | 8544 | |
| U | 4272 | |
| n | 4272 | |
| h | 4272 | |
| k | 4272 | |
| d | 4272 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 38448 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| c | 8544 | |
| e | 8544 | |
| U | 4272 | |
| n | 4272 | |
| h | 4272 | |
| k | 4272 | |
| d | 4272 |
qual_metodo_choice_outros
Categorical
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 33.5 KiB |
| Unchecked |
|---|
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 9 |
| Min length | 9 |
Characters and Unicode
| Total characters | 38448 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Unchecked |
|---|---|
| 2nd row | Unchecked |
| 3rd row | Unchecked |
| 4th row | Unchecked |
| 5th row | Unchecked |
Common Values
| Value | Count | Frequency (%) |
| Unchecked | 4272 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| unchecked | 4272 |
Most occurring characters
| Value | Count | Frequency (%) |
| c | 8544 | |
| e | 8544 | |
| U | 4272 | |
| n | 4272 | |
| h | 4272 | |
| k | 4272 | |
| d | 4272 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 34176 | |
| Uppercase Letter | 4272 | 11.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| c | 8544 | |
| e | 8544 | |
| n | 4272 | |
| h | 4272 | |
| k | 4272 | |
| d | 4272 |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 4272 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 38448 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| c | 8544 | |
| e | 8544 | |
| U | 4272 | |
| n | 4272 | |
| h | 4272 | |
| k | 4272 | |
| d | 4272 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 38448 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| c | 8544 | |
| e | 8544 | |
| U | 4272 | |
| n | 4272 | |
| h | 4272 | |
| k | 4272 | |
| d | 4272 |
qual_metodo_choice_nao_informou
Categorical
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 33.5 KiB |
| Unchecked |
|---|
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 9 |
| Min length | 9 |
Characters and Unicode
| Total characters | 38448 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Unchecked |
|---|---|
| 2nd row | Unchecked |
| 3rd row | Unchecked |
| 4th row | Unchecked |
| 5th row | Unchecked |
Common Values
| Value | Count | Frequency (%) |
| Unchecked | 4272 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| unchecked | 4272 |
Most occurring characters
| Value | Count | Frequency (%) |
| c | 8544 | |
| e | 8544 | |
| U | 4272 | |
| n | 4272 | |
| h | 4272 | |
| k | 4272 | |
| d | 4272 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 34176 | |
| Uppercase Letter | 4272 | 11.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| c | 8544 | |
| e | 8544 | |
| n | 4272 | |
| h | 4272 | |
| k | 4272 | |
| d | 4272 |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 4272 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 38448 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| c | 8544 | |
| e | 8544 | |
| U | 4272 | |
| n | 4272 | |
| h | 4272 | |
| k | 4272 | |
| d | 4272 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 38448 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| c | 8544 | |
| e | 8544 | |
| U | 4272 | |
| n | 4272 | |
| h | 4272 | |
| k | 4272 | |
| d | 4272 |
ja_fez_uso_de_drogas
Categorical
IMBALANCE  MISSING 
| Distinct | 2 |
|---|---|
| Distinct (%) | 1.3% |
| Missing | 4123 |
| Missing (%) | 96.5% |
| Memory size | 33.5 KiB |
| Não | |
|---|---|
| Sim | 1 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 447 |
|---|---|
| Distinct characters | 6 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 0.7% |
Sample
| 1st row | Não |
|---|---|
| 2nd row | Não |
| 3rd row | Não |
| 4th row | Não |
| 5th row | Não |
Common Values
| Value | Count | Frequency (%) |
| Não | 148 | 3.5% |
| Sim | 1 | < 0.1% |
| (Missing) | 4123 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| não | 148 | |
| sim | 1 | 0.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 148 | |
| ã | 148 | |
| o | 148 | |
| S | 1 | 0.2% |
| i | 1 | 0.2% |
| m | 1 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 298 | |
| Uppercase Letter | 149 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| ã | 148 | |
| o | 148 | |
| i | 1 | 0.3% |
| m | 1 | 0.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 148 | |
| S | 1 | 0.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 447 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 148 | |
| ã | 148 | |
| o | 148 | |
| S | 1 | 0.2% |
| i | 1 | 0.2% |
| m | 1 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 299 | |
| None | 148 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 148 | |
| o | 148 | |
| S | 1 | 0.3% |
| i | 1 | 0.3% |
| m | 1 | 0.3% |
None
| Value | Count | Frequency (%) |
| ã | 148 |
atividade_fisica
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | 1.3% |
| Missing | 3967 |
| Missing (%) | 92.9% |
| Memory size | 33.5 KiB |
| Não pratica | |
|---|---|
| Pratica regularmente | |
| Pratica esporadicamente | |
| Pratica frequentemente | 16 |
Length
| Max length | 23 |
|---|---|
| Median length | 11 |
| Mean length | 13.75082 |
| Min length | 11 |
Characters and Unicode
| Total characters | 4194 |
|---|---|
| Distinct characters | 21 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Pratica regularmente |
|---|---|
| 2nd row | Não pratica |
| 3rd row | Não pratica |
| 4th row | Não pratica |
| 5th row | Pratica regularmente |
Common Values
| Value | Count | Frequency (%) |
| Não pratica | 223 | 5.2% |
| Pratica regularmente | 43 | 1.0% |
| Pratica esporadicamente | 23 | 0.5% |
| Pratica frequentemente | 16 | 0.4% |
| (Missing) | 3967 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| pratica | 305 | |
| não | 223 | |
| regularmente | 43 | 7.0% |
| esporadicamente | 23 | 3.8% |
| frequentemente | 16 | 2.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 699 | |
| r | 430 | |
| t | 403 | |
| i | 328 | |
| c | 328 | |
| 305 | ||
| e | 278 | 6.6% |
| o | 246 | 5.9% |
| p | 246 | 5.9% |
| N | 223 | 5.3% |
| Other values (11) | 708 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3584 | |
| Space Separator | 305 | 7.3% |
| Uppercase Letter | 305 | 7.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 699 | |
| r | 430 | |
| t | 403 | |
| i | 328 | |
| c | 328 | |
| e | 278 | 7.8% |
| o | 246 | 6.9% |
| p | 246 | 6.9% |
| ã | 223 | 6.2% |
| n | 98 | 2.7% |
| Other values (8) | 305 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 223 | |
| P | 82 | 26.9% |
Space Separator
| Value | Count | Frequency (%) |
| 305 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3889 | |
| Common | 305 | 7.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 699 | |
| r | 430 | |
| t | 403 | |
| i | 328 | |
| c | 328 | |
| e | 278 | 7.1% |
| o | 246 | 6.3% |
| p | 246 | 6.3% |
| N | 223 | 5.7% |
| ã | 223 | 5.7% |
| Other values (10) | 485 |
Common
| Value | Count | Frequency (%) |
| 305 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3971 | |
| None | 223 | 5.3% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 699 | |
| r | 430 | |
| t | 403 | |
| i | 328 | |
| c | 328 | |
| 305 | ||
| e | 278 | 7.0% |
| o | 246 | 6.2% |
| p | 246 | 6.2% |
| N | 223 | 5.6% |
| Other values (10) | 485 |
None
| Value | Count | Frequency (%) |
| ã | 223 |
consumo_de_tabaco
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | 1.9% |
| Missing | 4060 |
| Missing (%) | 95.0% |
| Memory size | 33.5 KiB |
| Nunca fumou | |
|---|---|
| Fumou no passado | |
| Fuma atualmente | |
| não-informado | 3 |
Length
| Max length | 16 |
|---|---|
| Median length | 11 |
| Mean length | 12.339623 |
| Min length | 11 |
Characters and Unicode
| Total characters | 2616 |
|---|---|
| Distinct characters | 20 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Nunca fumou |
|---|---|
| 2nd row | Fumou no passado |
| 3rd row | Nunca fumou |
| 4th row | Fuma atualmente |
| 5th row | Nunca fumou |
Common Values
| Value | Count | Frequency (%) |
| Nunca fumou | 148 | 3.5% |
| Fumou no passado | 34 | 0.8% |
| Fuma atualmente | 27 | 0.6% |
| não-informado | 3 | 0.1% |
| (Missing) | 4060 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| fumou | 182 | |
| nunca | 148 | |
| no | 34 | 7.5% |
| passado | 34 | 7.5% |
| fuma | 27 | 5.9% |
| atualmente | 27 | 5.9% |
| não-informado | 3 | 0.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| u | 566 | |
| a | 300 | |
| o | 259 | |
| 243 | ||
| m | 239 | |
| n | 215 | 8.2% |
| f | 151 | 5.8% |
| N | 148 | 5.7% |
| c | 148 | 5.7% |
| s | 68 | 2.6% |
| Other values (10) | 279 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2161 | |
| Space Separator | 243 | 9.3% |
| Uppercase Letter | 209 | 8.0% |
| Dash Punctuation | 3 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| u | 566 | |
| a | 300 | |
| o | 259 | |
| m | 239 | |
| n | 215 | 9.9% |
| f | 151 | 7.0% |
| c | 148 | 6.8% |
| s | 68 | 3.1% |
| e | 54 | 2.5% |
| t | 54 | 2.5% |
| Other values (6) | 107 | 5.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 148 | |
| F | 61 |
Space Separator
| Value | Count | Frequency (%) |
| 243 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2370 | |
| Common | 246 | 9.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| u | 566 | |
| a | 300 | |
| o | 259 | |
| m | 239 | |
| n | 215 | 9.1% |
| f | 151 | 6.4% |
| N | 148 | 6.2% |
| c | 148 | 6.2% |
| s | 68 | 2.9% |
| F | 61 | 2.6% |
| Other values (8) | 215 | 9.1% |
Common
| Value | Count | Frequency (%) |
| 243 | ||
| - | 3 | 1.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2613 | |
| None | 3 | 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| u | 566 | |
| a | 300 | |
| o | 259 | |
| 243 | ||
| m | 239 | |
| n | 215 | 8.2% |
| f | 151 | 5.8% |
| N | 148 | 5.7% |
| c | 148 | 5.7% |
| s | 68 | 2.6% |
| Other values (9) | 276 |
None
| Value | Count | Frequency (%) |
| ã | 3 |
consumo_de_alcool
Categorical
IMBALANCE  MISSING 
| Distinct | 4 |
|---|---|
| Distinct (%) | 2.0% |
| Missing | 4068 |
| Missing (%) | 95.2% |
| Memory size | 33.5 KiB |
| Nunca bebeu | |
|---|---|
| Bebia no passado | |
| Bebe atualmente | 6 |
| não-informado | 4 |
Length
| Max length | 16 |
|---|---|
| Median length | 11 |
| Mean length | 12.014706 |
| Min length | 11 |
Characters and Unicode
| Total characters | 2451 |
|---|---|
| Distinct characters | 21 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Bebe atualmente |
|---|---|
| 2nd row | Bebia no passado |
| 3rd row | Bebia no passado |
| 4th row | Nunca bebeu |
| 5th row | Bebia no passado |
Common Values
| Value | Count | Frequency (%) |
| Nunca bebeu | 159 | 3.7% |
| Bebia no passado | 35 | 0.8% |
| Bebe atualmente | 6 | 0.1% |
| não-informado | 4 | 0.1% |
| (Missing) | 4068 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| nunca | 159 | |
| bebeu | 159 | |
| bebia | 35 | 8.0% |
| no | 35 | 8.0% |
| passado | 35 | 8.0% |
| bebe | 6 | 1.4% |
| atualmente | 6 | 1.4% |
| não-informado | 4 | 0.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 377 | |
| b | 359 | |
| u | 324 | |
| a | 280 | |
| 235 | ||
| n | 208 | |
| N | 159 | |
| c | 159 | |
| o | 82 | 3.3% |
| s | 70 | 2.9% |
| Other values (11) | 198 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2012 | |
| Space Separator | 235 | 9.6% |
| Uppercase Letter | 200 | 8.2% |
| Dash Punctuation | 4 | 0.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 377 | |
| b | 359 | |
| u | 324 | |
| a | 280 | |
| n | 208 | |
| c | 159 | |
| o | 82 | 4.1% |
| s | 70 | 3.5% |
| i | 39 | 1.9% |
| d | 39 | 1.9% |
| Other values (7) | 75 | 3.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 159 | |
| B | 41 | 20.5% |
Space Separator
| Value | Count | Frequency (%) |
| 235 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2212 | |
| Common | 239 | 9.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 377 | |
| b | 359 | |
| u | 324 | |
| a | 280 | |
| n | 208 | |
| N | 159 | |
| c | 159 | |
| o | 82 | 3.7% |
| s | 70 | 3.2% |
| B | 41 | 1.9% |
| Other values (9) | 153 |
Common
| Value | Count | Frequency (%) |
| 235 | ||
| - | 4 | 1.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2447 | |
| None | 4 | 0.2% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 377 | |
| b | 359 | |
| u | 324 | |
| a | 280 | |
| 235 | ||
| n | 208 | |
| N | 159 | |
| c | 159 | |
| o | 82 | 3.4% |
| s | 70 | 2.9% |
| Other values (10) | 194 |
None
| Value | Count | Frequency (%) |
| ã | 4 |
possui_historico_familiar_de_cancer
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 4082 |
| Missing (%) | 95.6% |
| Memory size | 33.5 KiB |
| Sim | |
|---|---|
| Não |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 570 |
|---|---|
| Distinct characters | 6 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Sim |
|---|---|
| 2nd row | Não |
| 3rd row | Não |
| 4th row | Sim |
| 5th row | Não |
Common Values
| Value | Count | Frequency (%) |
| Sim | 138 | 3.2% |
| Não | 52 | 1.2% |
| (Missing) | 4082 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| sim | 138 | |
| não | 52 | 27.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 138 | |
| i | 138 | |
| m | 138 | |
| N | 52 | 9.1% |
| ã | 52 | 9.1% |
| o | 52 | 9.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 380 | |
| Uppercase Letter | 190 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 138 | |
| m | 138 | |
| ã | 52 | 13.7% |
| o | 52 | 13.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 138 | |
| N | 52 | 27.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 570 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 138 | |
| i | 138 | |
| m | 138 | |
| N | 52 | 9.1% |
| ã | 52 | 9.1% |
| o | 52 | 9.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 518 | |
| None | 52 | 9.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| S | 138 | |
| i | 138 | |
| m | 138 | |
| N | 52 | 10.0% |
| o | 52 | 10.0% |
None
| Value | Count | Frequency (%) |
| ã | 52 |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 33.5 KiB |
| Unchecked | |
|---|---|
| Checked | 89 |
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 8.9583333 |
| Min length | 7 |
Characters and Unicode
| Total characters | 38270 |
|---|---|
| Distinct characters | 8 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Unchecked |
|---|---|
| 2nd row | Unchecked |
| 3rd row | Unchecked |
| 4th row | Unchecked |
| 5th row | Unchecked |
Common Values
| Value | Count | Frequency (%) |
| Unchecked | 4183 | |
| Checked | 89 | 2.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| unchecked | 4183 | |
| checked | 89 | 2.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 8544 | |
| c | 8455 | |
| h | 4272 | |
| k | 4272 | |
| d | 4272 | |
| U | 4183 | |
| n | 4183 | |
| C | 89 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 33998 | |
| Uppercase Letter | 4272 | 11.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 8544 | |
| c | 8455 | |
| h | 4272 | |
| k | 4272 | |
| d | 4272 | |
| n | 4183 |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 4183 | |
| C | 89 | 2.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 38270 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 8544 | |
| c | 8455 | |
| h | 4272 | |
| k | 4272 | |
| d | 4272 | |
| U | 4183 | |
| n | 4183 | |
| C | 89 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 38270 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 8544 | |
| c | 8455 | |
| h | 4272 | |
| k | 4272 | |
| d | 4272 | |
| U | 4183 | |
| n | 4183 | |
| C | 89 | 0.2% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 33.5 KiB |
| Unchecked | |
|---|---|
| Checked | 70 |
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 8.9672285 |
| Min length | 7 |
Characters and Unicode
| Total characters | 38308 |
|---|---|
| Distinct characters | 8 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Unchecked |
|---|---|
| 2nd row | Unchecked |
| 3rd row | Unchecked |
| 4th row | Unchecked |
| 5th row | Unchecked |
Common Values
| Value | Count | Frequency (%) |
| Unchecked | 4202 | |
| Checked | 70 | 1.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| unchecked | 4202 | |
| checked | 70 | 1.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 8544 | |
| c | 8474 | |
| h | 4272 | |
| k | 4272 | |
| d | 4272 | |
| U | 4202 | |
| n | 4202 | |
| C | 70 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 34036 | |
| Uppercase Letter | 4272 | 11.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 8544 | |
| c | 8474 | |
| h | 4272 | |
| k | 4272 | |
| d | 4272 | |
| n | 4202 |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 4202 | |
| C | 70 | 1.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 38308 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 8544 | |
| c | 8474 | |
| h | 4272 | |
| k | 4272 | |
| d | 4272 | |
| U | 4202 | |
| n | 4202 | |
| C | 70 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 38308 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 8544 | |
| c | 8474 | |
| h | 4272 | |
| k | 4272 | |
| d | 4272 | |
| U | 4202 | |
| n | 4202 | |
| C | 70 | 0.2% |
grau_de_parentesco_de_familiar_com_cancer_choice_terceiro_bisavos_tio_avos_primos_sobrinhos
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 33.5 KiB |
| Unchecked | |
|---|---|
| Checked | 48 |
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 8.9775281 |
| Min length | 7 |
Characters and Unicode
| Total characters | 38352 |
|---|---|
| Distinct characters | 8 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Unchecked |
|---|---|
| 2nd row | Unchecked |
| 3rd row | Unchecked |
| 4th row | Unchecked |
| 5th row | Unchecked |
Common Values
| Value | Count | Frequency (%) |
| Unchecked | 4224 | |
| Checked | 48 | 1.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| unchecked | 4224 | |
| checked | 48 | 1.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 8544 | |
| c | 8496 | |
| h | 4272 | |
| k | 4272 | |
| d | 4272 | |
| U | 4224 | |
| n | 4224 | |
| C | 48 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 34080 | |
| Uppercase Letter | 4272 | 11.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 8544 | |
| c | 8496 | |
| h | 4272 | |
| k | 4272 | |
| d | 4272 | |
| n | 4224 |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 4224 | |
| C | 48 | 1.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 38352 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 8544 | |
| c | 8496 | |
| h | 4272 | |
| k | 4272 | |
| d | 4272 | |
| U | 4224 | |
| n | 4224 | |
| C | 48 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 38352 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 8544 | |
| c | 8496 | |
| h | 4272 | |
| k | 4272 | |
| d | 4272 | |
| U | 4224 | |
| n | 4224 | |
| C | 48 | 0.1% |
regime_de_tratamento
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 1409 |
| Missing (%) | 33.0% |
| Memory size | 33.5 KiB |
| Terapia Adjuvante | |
|---|---|
| Terapia Neoadjuvante | |
| Paliativo | 70 |
| Não fez quimioterapia | 25 |
Length
| Max length | 21 |
|---|---|
| Median length | 20 |
| Mean length | 18.249738 |
| Min length | 9 |
Characters and Unicode
| Total characters | 52249 |
|---|---|
| Distinct characters | 23 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Terapia Adjuvante |
|---|---|
| 2nd row | Terapia Adjuvante |
| 3rd row | Terapia Neoadjuvante |
| 4th row | Terapia Adjuvante |
| 5th row | Terapia Neoadjuvante |
Common Values
| Value | Count | Frequency (%) |
| Terapia Adjuvante | 1422 | |
| Terapia Neoadjuvante | 1346 | |
| Paliativo | 70 | 1.6% |
| Não fez quimioterapia | 25 | 0.6% |
| (Missing) | 1409 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| terapia | 2768 | |
| adjuvante | 1422 | |
| neoadjuvante | 1346 | |
| paliativo | 70 | 1.2% |
| não | 25 | 0.4% |
| fez | 25 | 0.4% |
| quimioterapia | 25 | 0.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 9840 | |
| e | 6932 | |
| i | 2983 | 5.7% |
| t | 2863 | 5.5% |
| v | 2838 | 5.4% |
| 2818 | 5.4% | |
| u | 2793 | 5.3% |
| r | 2793 | 5.3% |
| p | 2793 | 5.3% |
| n | 2768 | 5.3% |
| Other values (13) | 12828 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 43800 | |
| Uppercase Letter | 5631 | 10.8% |
| Space Separator | 2818 | 5.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 9840 | |
| e | 6932 | |
| i | 2983 | 6.8% |
| t | 2863 | 6.5% |
| v | 2838 | 6.5% |
| u | 2793 | 6.4% |
| r | 2793 | 6.4% |
| p | 2793 | 6.4% |
| n | 2768 | 6.3% |
| j | 2768 | 6.3% |
| Other values (8) | 4429 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 2768 | |
| A | 1422 | |
| N | 1371 | |
| P | 70 | 1.2% |
Space Separator
| Value | Count | Frequency (%) |
| 2818 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 49431 | |
| Common | 2818 | 5.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 9840 | |
| e | 6932 | |
| i | 2983 | 6.0% |
| t | 2863 | 5.8% |
| v | 2838 | 5.7% |
| u | 2793 | 5.7% |
| r | 2793 | 5.7% |
| p | 2793 | 5.7% |
| n | 2768 | 5.6% |
| T | 2768 | 5.6% |
| Other values (12) | 10060 |
Common
| Value | Count | Frequency (%) |
| 2818 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 52224 | |
| None | 25 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 9840 | |
| e | 6932 | |
| i | 2983 | 5.7% |
| t | 2863 | 5.5% |
| v | 2838 | 5.4% |
| 2818 | 5.4% | |
| u | 2793 | 5.3% |
| r | 2793 | 5.3% |
| p | 2793 | 5.3% |
| n | 2768 | 5.3% |
| Other values (12) | 12803 |
None
| Value | Count | Frequency (%) |
| ã | 25 |
hormonioterapia
Categorical
CONSTANT  MISSING 
| Distinct | 1 |
|---|---|
| Distinct (%) | 33.3% |
| Missing | 4269 |
| Missing (%) | 99.9% |
| Memory size | 33.5 KiB |
| Adjuvante |
|---|
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 9 |
| Min length | 9 |
Characters and Unicode
| Total characters | 27 |
|---|---|
| Distinct characters | 9 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Adjuvante |
|---|---|
| 2nd row | Adjuvante |
| 3rd row | Adjuvante |
Common Values
| Value | Count | Frequency (%) |
| Adjuvante | 3 | 0.1% |
| (Missing) | 4269 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| adjuvante | 3 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 3 | |
| d | 3 | |
| j | 3 | |
| u | 3 | |
| v | 3 | |
| a | 3 | |
| n | 3 | |
| t | 3 | |
| e | 3 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 24 | |
| Uppercase Letter | 3 | 11.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| d | 3 | |
| j | 3 | |
| u | 3 | |
| v | 3 | |
| a | 3 | |
| n | 3 | |
| t | 3 | |
| e | 3 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 27 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 3 | |
| d | 3 | |
| j | 3 | |
| u | 3 | |
| v | 3 | |
| a | 3 | |
| n | 3 | |
| t | 3 | |
| e | 3 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 27 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 3 | |
| d | 3 | |
| j | 3 | |
| u | 3 | |
| v | 3 | |
| a | 3 | |
| n | 3 | |
| t | 3 | |
| e | 3 |
data_da_cirurgia
Categorical
HIGH CARDINALITY  MISSING  UNIFORM 
| Distinct | 1653 |
|---|---|
| Distinct (%) | 74.6% |
| Missing | 2056 |
| Missing (%) | 48.1% |
| Memory size | 33.5 KiB |
| 2016-02-18 | 5 |
|---|---|
| 2011-09-05 | 5 |
| 2013-08-25 | 4 |
| 2011-06-08 | 4 |
| 2012-08-07 | 4 |
| Other values (1648) |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 22160 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1206 ? |
|---|---|
| Unique (%) | 54.4% |
Sample
| 1st row | 2009-09-04 |
|---|---|
| 2nd row | 2011-07-05 |
| 3rd row | 2011-05-21 |
| 4th row | 2010-10-05 |
| 5th row | 2009-05-07 |
Common Values
| Value | Count | Frequency (%) |
| 2016-02-18 | 5 | 0.1% |
| 2011-09-05 | 5 | 0.1% |
| 2013-08-25 | 4 | 0.1% |
| 2011-06-08 | 4 | 0.1% |
| 2012-08-07 | 4 | 0.1% |
| 2012-07-12 | 4 | 0.1% |
| 2018-05-05 | 4 | 0.1% |
| 2013-01-08 | 4 | 0.1% |
| 2013-05-09 | 4 | 0.1% |
| 2012-11-20 | 4 | 0.1% |
| Other values (1643) | 2174 | |
| (Missing) | 2056 |
Length
| Value | Count | Frequency (%) |
| 2016-02-18 | 5 | 0.2% |
| 2011-09-05 | 5 | 0.2% |
| 2013-08-25 | 4 | 0.2% |
| 2016-10-13 | 4 | 0.2% |
| 2011-06-08 | 4 | 0.2% |
| 2011-05-21 | 4 | 0.2% |
| 2017-06-03 | 4 | 0.2% |
| 2012-02-19 | 4 | 0.2% |
| 2016-03-04 | 4 | 0.2% |
| 2014-05-28 | 4 | 0.2% |
| Other values (1643) | 2174 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 5085 | |
| - | 4432 | |
| 1 | 4216 | |
| 2 | 3949 | |
| 3 | 753 | 3.4% |
| 6 | 734 | 3.3% |
| 5 | 702 | 3.2% |
| 7 | 666 | 3.0% |
| 8 | 573 | 2.6% |
| 4 | 563 | 2.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 17728 | |
| Dash Punctuation | 4432 | 20.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 5085 | |
| 1 | 4216 | |
| 2 | 3949 | |
| 3 | 753 | 4.2% |
| 6 | 734 | 4.1% |
| 5 | 702 | 4.0% |
| 7 | 666 | 3.8% |
| 8 | 573 | 3.2% |
| 4 | 563 | 3.2% |
| 9 | 487 | 2.7% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4432 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 22160 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 5085 | |
| - | 4432 | |
| 1 | 4216 | |
| 2 | 3949 | |
| 3 | 753 | 3.4% |
| 6 | 734 | 3.3% |
| 5 | 702 | 3.2% |
| 7 | 666 | 3.0% |
| 8 | 573 | 2.6% |
| 4 | 563 | 2.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 22160 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 5085 | |
| - | 4432 | |
| 1 | 4216 | |
| 2 | 3949 | |
| 3 | 753 | 3.4% |
| 6 | 734 | 3.3% |
| 5 | 702 | 3.2% |
| 7 | 666 | 3.0% |
| 8 | 573 | 2.6% |
| 4 | 563 | 2.5% |
tipo_de_terapia_anti_her2_neoadjuvante
Categorical
IMBALANCE  MISSING 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 3138 |
| Missing (%) | 73.5% |
| Memory size | 33.5 KiB |
| Trastuzumabe | |
|---|---|
| Trastuzumabe + Pertuzumabe | 4 |
Length
| Max length | 26 |
|---|---|
| Median length | 12 |
| Mean length | 12.049383 |
| Min length | 12 |
Characters and Unicode
| Total characters | 13664 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Trastuzumabe |
|---|---|
| 2nd row | Trastuzumabe |
| 3rd row | Trastuzumabe |
| 4th row | Trastuzumabe |
| 5th row | Trastuzumabe |
Common Values
| Value | Count | Frequency (%) |
| Trastuzumabe | 1130 | 26.5% |
| Trastuzumabe + Pertuzumabe | 4 | 0.1% |
| (Missing) | 3138 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| trastuzumabe | 1134 | |
| 4 | 0.4% | |
| pertuzumabe | 4 | 0.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| u | 2276 | |
| a | 2272 | |
| e | 1142 | |
| r | 1138 | |
| t | 1138 | |
| z | 1138 | |
| m | 1138 | |
| b | 1138 | |
| T | 1134 | |
| s | 1134 | |
| Other values (3) | 16 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 12514 | |
| Uppercase Letter | 1138 | 8.3% |
| Space Separator | 8 | 0.1% |
| Math Symbol | 4 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| u | 2276 | |
| a | 2272 | |
| e | 1142 | |
| r | 1138 | |
| t | 1138 | |
| z | 1138 | |
| m | 1138 | |
| b | 1138 | |
| s | 1134 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 1134 | |
| P | 4 | 0.4% |
Space Separator
| Value | Count | Frequency (%) |
| 8 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 13652 | |
| Common | 12 | 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| u | 2276 | |
| a | 2272 | |
| e | 1142 | |
| r | 1138 | |
| t | 1138 | |
| z | 1138 | |
| m | 1138 | |
| b | 1138 | |
| T | 1134 | |
| s | 1134 |
Common
| Value | Count | Frequency (%) |
| 8 | ||
| + | 4 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 13664 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| u | 2276 | |
| a | 2272 | |
| e | 1142 | |
| r | 1138 | |
| t | 1138 | |
| z | 1138 | |
| m | 1138 | |
| b | 1138 | |
| T | 1134 | |
| s | 1134 | |
| Other values (3) | 16 | 0.1% |
radioterapia
Categorical
CONSTANT  MISSING 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1947 |
| Missing (%) | 45.6% |
| Memory size | 33.5 KiB |
| Sim |
|---|
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 6975 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Sim |
|---|---|
| 2nd row | Sim |
| 3rd row | Sim |
| 4th row | Sim |
| 5th row | Sim |
Common Values
| Value | Count | Frequency (%) |
| Sim | 2325 | |
| (Missing) | 1947 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| sim | 2325 |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 2325 | |
| i | 2325 | |
| m | 2325 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4650 | |
| Uppercase Letter | 2325 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 2325 | |
| m | 2325 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 2325 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 6975 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 2325 | |
| i | 2325 | |
| m | 2325 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6975 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| S | 2325 | |
| i | 2325 | |
| m | 2325 |
data_de_inicio_do_tratamento_quimioterapia
Categorical
HIGH CARDINALITY  MISSING  UNIFORM 
| Distinct | 1766 |
|---|---|
| Distinct (%) | 62.6% |
| Missing | 1450 |
| Missing (%) | 33.9% |
| Memory size | 33.5 KiB |
| 2016-01-04 | 7 |
|---|---|
| 2017-09-04 | 6 |
| 2013-08-01 | 6 |
| 2011-11-25 | 6 |
| 2012-01-12 | 5 |
| Other values (1761) |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 28220 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1021 ? |
|---|---|
| Unique (%) | 36.2% |
Sample
| 1st row | 2014-08-24 |
|---|---|
| 2nd row | 2011-09-08 |
| 3rd row | 2099-01-30 |
| 4th row | 2020-09-04 |
| 5th row | 2016-08-24 |
Common Values
| Value | Count | Frequency (%) |
| 2016-01-04 | 7 | 0.2% |
| 2017-09-04 | 6 | 0.1% |
| 2013-08-01 | 6 | 0.1% |
| 2011-11-25 | 6 | 0.1% |
| 2012-01-12 | 5 | 0.1% |
| 2011-11-27 | 5 | 0.1% |
| 2013-11-14 | 5 | 0.1% |
| 2016-09-02 | 5 | 0.1% |
| 2017-05-29 | 5 | 0.1% |
| 2016-03-15 | 5 | 0.1% |
| Other values (1756) | 2767 | |
| (Missing) | 1450 |
Length
| Value | Count | Frequency (%) |
| 2016-01-04 | 7 | 0.2% |
| 2013-08-01 | 6 | 0.2% |
| 2011-11-25 | 6 | 0.2% |
| 2017-09-04 | 6 | 0.2% |
| 2017-05-29 | 5 | 0.2% |
| 2015-06-08 | 5 | 0.2% |
| 2016-03-15 | 5 | 0.2% |
| 2017-07-25 | 5 | 0.2% |
| 2016-09-02 | 5 | 0.2% |
| 2013-11-14 | 5 | 0.2% |
| Other values (1756) | 2767 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 6291 | |
| - | 5644 | |
| 1 | 5547 | |
| 2 | 4977 | |
| 3 | 969 | 3.4% |
| 7 | 950 | 3.4% |
| 6 | 935 | 3.3% |
| 5 | 896 | 3.2% |
| 4 | 774 | 2.7% |
| 8 | 721 | 2.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 22576 | |
| Dash Punctuation | 5644 | 20.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 6291 | |
| 1 | 5547 | |
| 2 | 4977 | |
| 3 | 969 | 4.3% |
| 7 | 950 | 4.2% |
| 6 | 935 | 4.1% |
| 5 | 896 | 4.0% |
| 4 | 774 | 3.4% |
| 8 | 721 | 3.2% |
| 9 | 516 | 2.3% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 5644 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 28220 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 6291 | |
| - | 5644 | |
| 1 | 5547 | |
| 2 | 4977 | |
| 3 | 969 | 3.4% |
| 7 | 950 | 3.4% |
| 6 | 935 | 3.3% |
| 5 | 896 | 3.2% |
| 4 | 774 | 2.7% |
| 8 | 721 | 2.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 28220 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 6291 | |
| - | 5644 | |
| 1 | 5547 | |
| 2 | 4977 | |
| 3 | 969 | 3.4% |
| 7 | 950 | 3.4% |
| 6 | 935 | 3.3% |
| 5 | 896 | 3.2% |
| 4 | 774 | 2.7% |
| 8 | 721 | 2.6% |
esquema_de_hormonioterapia
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | 25.0% |
| Missing | 4260 |
| Missing (%) | 99.7% |
| Memory size | 33.5 KiB |
| Inibidor de aromatase isolado | |
|---|---|
| Switch: tamoxifeno seguido de IA | |
| Tamoxifeno isolado |
Length
| Max length | 32 |
|---|---|
| Median length | 29 |
| Mean length | 27.25 |
| Min length | 18 |
Characters and Unicode
| Total characters | 327 |
|---|---|
| Distinct characters | 25 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Inibidor de aromatase isolado |
|---|---|
| 2nd row | Switch: tamoxifeno seguido de IA |
| 3rd row | Tamoxifeno isolado |
| 4th row | Switch: tamoxifeno seguido de IA |
| 5th row | Tamoxifeno isolado |
Common Values
| Value | Count | Frequency (%) |
| Inibidor de aromatase isolado | 5 | 0.1% |
| Switch: tamoxifeno seguido de IA | 4 | 0.1% |
| Tamoxifeno isolado | 3 | 0.1% |
| (Missing) | 4260 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| de | 9 | |
| isolado | 8 | |
| tamoxifeno | 7 | |
| inibidor | 5 | |
| aromatase | 5 | |
| switch | 4 | |
| seguido | 4 | |
| ia | 4 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 44 | |
| 34 | ||
| i | 33 | |
| a | 30 | 9.2% |
| d | 26 | 8.0% |
| e | 25 | 7.6% |
| s | 17 | 5.2% |
| t | 13 | 4.0% |
| n | 12 | 3.7% |
| m | 12 | 3.7% |
| Other values (15) | 81 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 269 | |
| Space Separator | 34 | 10.4% |
| Uppercase Letter | 20 | 6.1% |
| Other Punctuation | 4 | 1.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 44 | |
| i | 33 | |
| a | 30 | |
| d | 26 | |
| e | 25 | |
| s | 17 | 6.3% |
| t | 13 | 4.8% |
| n | 12 | 4.5% |
| m | 12 | 4.5% |
| r | 10 | 3.7% |
| Other values (9) | 47 |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 9 | |
| S | 4 | |
| A | 4 | |
| T | 3 | 15.0% |
Space Separator
| Value | Count | Frequency (%) |
| 34 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 289 | |
| Common | 38 | 11.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 44 | |
| i | 33 | |
| a | 30 | |
| d | 26 | 9.0% |
| e | 25 | 8.7% |
| s | 17 | 5.9% |
| t | 13 | 4.5% |
| n | 12 | 4.2% |
| m | 12 | 4.2% |
| r | 10 | 3.5% |
| Other values (13) | 67 |
Common
| Value | Count | Frequency (%) |
| 34 | ||
| : | 4 | 10.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 327 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 44 | |
| 34 | ||
| i | 33 | |
| a | 30 | 9.2% |
| d | 26 | 8.0% |
| e | 25 | 7.6% |
| s | 17 | 5.2% |
| t | 13 | 4.0% |
| n | 12 | 3.7% |
| m | 12 | 3.7% |
| Other values (15) | 81 |
data_do_inicio_hormonioterapia_adjuvante
Categorical
MISSING  UNIFORM 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 4270 |
| Missing (%) | > 99.9% |
| Memory size | 33.5 KiB |
| 2013-01-07 | |
|---|---|
| 2021-06-24 |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 20 |
|---|---|
| Distinct characters | 8 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 2013-01-07 |
|---|---|
| 2nd row | 2021-06-24 |
Common Values
| Value | Count | Frequency (%) |
| 2013-01-07 | 1 | < 0.1% |
| 2021-06-24 | 1 | < 0.1% |
| (Missing) | 4270 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2013-01-07 | 1 | |
| 2021-06-24 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 5 | |
| 2 | 4 | |
| - | 4 | |
| 1 | 3 | |
| 3 | 1 | 5.0% |
| 7 | 1 | 5.0% |
| 6 | 1 | 5.0% |
| 4 | 1 | 5.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 16 | |
| Dash Punctuation | 4 | 20.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 5 | |
| 2 | 4 | |
| 1 | 3 | |
| 3 | 1 | 6.2% |
| 7 | 1 | 6.2% |
| 6 | 1 | 6.2% |
| 4 | 1 | 6.2% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 20 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 5 | |
| 2 | 4 | |
| - | 4 | |
| 1 | 3 | |
| 3 | 1 | 5.0% |
| 7 | 1 | 5.0% |
| 6 | 1 | 5.0% |
| 4 | 1 | 5.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 20 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 5 | |
| 2 | 4 | |
| - | 4 | |
| 1 | 3 | |
| 3 | 1 | 5.0% |
| 7 | 1 | 5.0% |
| 6 | 1 | 5.0% |
| 4 | 1 | 5.0% |
data_de_inicio_da_radioterapia
Categorical
HIGH CARDINALITY  MISSING  UNIFORM 
| Distinct | 1708 |
|---|---|
| Distinct (%) | 73.5% |
| Missing | 1949 |
| Missing (%) | 45.6% |
| Memory size | 33.5 KiB |
| 2016-06-03 | 5 |
|---|---|
| 2016-05-05 | 5 |
| 2013-08-04 | 5 |
| 2014-07-20 | 4 |
| 2016-11-06 | 4 |
| Other values (1703) |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 23230 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1235 ? |
|---|---|
| Unique (%) | 53.2% |
Sample
| 1st row | 2011-11-23 |
|---|---|
| 2nd row | 2010-04-27 |
| 3rd row | 2011-10-11 |
| 4th row | 2010-12-16 |
| 5th row | 2011-07-27 |
Common Values
| Value | Count | Frequency (%) |
| 2016-06-03 | 5 | 0.1% |
| 2016-05-05 | 5 | 0.1% |
| 2013-08-04 | 5 | 0.1% |
| 2014-07-20 | 4 | 0.1% |
| 2016-11-06 | 4 | 0.1% |
| 2018-09-27 | 4 | 0.1% |
| 2018-09-21 | 4 | 0.1% |
| 2012-05-26 | 4 | 0.1% |
| 2012-03-01 | 4 | 0.1% |
| 2013-01-02 | 4 | 0.1% |
| Other values (1698) | 2280 | |
| (Missing) | 1949 |
Length
| Value | Count | Frequency (%) |
| 2016-06-03 | 5 | 0.2% |
| 2013-08-04 | 5 | 0.2% |
| 2016-05-05 | 5 | 0.2% |
| 2016-09-20 | 4 | 0.2% |
| 2013-10-19 | 4 | 0.2% |
| 2016-12-15 | 4 | 0.2% |
| 2013-06-17 | 4 | 0.2% |
| 2016-05-24 | 4 | 0.2% |
| 2016-05-15 | 4 | 0.2% |
| 2012-04-26 | 4 | 0.2% |
| Other values (1698) | 2280 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 5286 | |
| - | 4646 | |
| 1 | 4383 | |
| 2 | 4158 | |
| 3 | 792 | 3.4% |
| 6 | 754 | 3.2% |
| 5 | 720 | 3.1% |
| 8 | 711 | 3.1% |
| 7 | 701 | 3.0% |
| 4 | 599 | 2.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 18584 | |
| Dash Punctuation | 4646 | 20.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 5286 | |
| 1 | 4383 | |
| 2 | 4158 | |
| 3 | 792 | 4.3% |
| 6 | 754 | 4.1% |
| 5 | 720 | 3.9% |
| 8 | 711 | 3.8% |
| 7 | 701 | 3.8% |
| 4 | 599 | 3.2% |
| 9 | 480 | 2.6% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4646 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 23230 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 5286 | |
| - | 4646 | |
| 1 | 4383 | |
| 2 | 4158 | |
| 3 | 792 | 3.4% |
| 6 | 754 | 3.2% |
| 5 | 720 | 3.1% |
| 8 | 711 | 3.1% |
| 7 | 701 | 3.0% |
| 4 | 599 | 2.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 23230 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 5286 | |
| - | 4646 | |
| 1 | 4383 | |
| 2 | 4158 | |
| 3 | 792 | 3.4% |
| 6 | 754 | 3.2% |
| 5 | 720 | 3.1% |
| 8 | 711 | 3.1% |
| 7 | 701 | 3.0% |
| 4 | 599 | 2.6% |
| record_id | repeat_instrument | repeat_instance | escolaridade | idade_do_paciente_ao_primeiro_diagnostico | sexo | raca_declarada_biobanco | uf_de_nascimento_do_paciente | uf_de_residencia_do_paciente | data_da_ultima_informacao_sobre_o_paciente | ultima_informacao_do_paciente | tempo_de_seguimento_em_dias_desde_o_ultimo_tumor_no_caso_de_tumores_multiplos_dt_pci | ja_ficou_gravida | quantas_vezes_ficou_gravida | numero_de_partos | idade_na_primeira_gestacao | abortou | amamentou_na_primeira_gestacao | por_quanto_tempo_amamentou | historia_familiar_de_cancer_relacionado_a_sindrome_de_cancer_de_mama_e_ovario_hereditaria_choice_nao | historia_familiar_de_cancer_relacionado_a_sindrome_de_cancer_de_mama_e_ovario_hereditaria_choice_sim_1o_grau_apenas_1_caso | historia_familiar_de_cancer_relacionado_a_sindrome_de_cancer_de_mama_e_ovario_hereditaria_choice_sim_1o_grau_mais_de_1_caso | historia_familiar_de_cancer_relacionado_a_sindrome_de_cancer_de_mama_e_ovario_hereditaria_choice_sim_2o_grau_apenas_1_caso | historia_familiar_de_cancer_relacionado_a_sindrome_de_cancer_de_mama_e_ovario_hereditaria_choice_sim_2o_grau_mais_de_1_caso | idade_da_primeira_mentruacao | faz_uso_de_metodos_contraceptivo | qual_metodo_choice_pilula_anticoncepcional | qual_metodo_choice_diu | qual_metodo_choice_camisinha | qual_metodo_choice_outros | qual_metodo_choice_nao_informou | ja_fez_uso_de_drogas | atividade_fisica | consumo_de_tabaco | consumo_de_alcool | possui_historico_familiar_de_cancer | grau_de_parentesco_de_familiar_com_cancer_choice_primeiro_pais_irmaos_filhos | grau_de_parentesco_de_familiar_com_cancer_choice_segundo_avos_tios_e_netos | grau_de_parentesco_de_familiar_com_cancer_choice_terceiro_bisavos_tio_avos_primos_sobrinhos | regime_de_tratamento | hormonioterapia | data_da_cirurgia | tipo_de_terapia_anti_her2_neoadjuvante | radioterapia | data_de_inicio_do_tratamento_quimioterapia | esquema_de_hormonioterapia | data_do_inicio_hormonioterapia_adjuvante | data_de_inicio_da_radioterapia | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 302 | NaN | NaN | ENS. FUNDAMENTAL INCOMPLETO | 51.0 | Feminino | NaN | NaN | NaN | 2014-04-26 | Obito por câncer | 2225.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | Unchecked | Unchecked | Unchecked | Unchecked | Unchecked | NaN | NaN | Unchecked | Unchecked | Unchecked | Unchecked | Unchecked | NaN | NaN | NaN | NaN | NaN | Unchecked | Unchecked | Unchecked | NaN | NaN | NaN | Trastuzumabe | NaN | NaN | Inibidor de aromatase isolado | NaN | NaN |
| 1 | 710 | NaN | NaN | ENSINO MÉDIO | 58.0 | Feminino | NaN | NaN | NaN | 2016-11-17 | Vivo, SOE | 3294.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | Unchecked | Unchecked | Unchecked | Unchecked | Unchecked | NaN | NaN | Unchecked | Unchecked | Unchecked | Unchecked | Unchecked | NaN | NaN | NaN | NaN | NaN | Unchecked | Unchecked | Unchecked | Terapia Adjuvante | NaN | 2009-09-04 | NaN | NaN | 2014-08-24 | NaN | NaN | NaN |
| 2 | 752 | NaN | NaN | ENS. FUNDAMENTAL INCOMPLETO | 56.0 | Feminino | NaN | NaN | NaN | 2019-05-02 | Vivo, SOE | 4153.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | Unchecked | Unchecked | Unchecked | Unchecked | Unchecked | NaN | NaN | Unchecked | Unchecked | Unchecked | Unchecked | Unchecked | NaN | NaN | NaN | NaN | NaN | Unchecked | Unchecked | Unchecked | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 3 | 1367 | NaN | NaN | ENS. FUNDAMENTAL INCOMPLETO | 63.0 | Feminino | NaN | NaN | NaN | 2011-09-29 | Obito por câncer | 1331.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | Unchecked | Unchecked | Unchecked | Unchecked | Unchecked | NaN | NaN | Unchecked | Unchecked | Unchecked | Unchecked | Unchecked | NaN | NaN | NaN | NaN | NaN | Unchecked | Unchecked | Unchecked | NaN | NaN | 2011-07-05 | NaN | NaN | NaN | NaN | NaN | NaN |
| 4 | 1589 | NaN | NaN | ENS. FUNDAMENTAL COMPLETO | 42.0 | Feminino | NaN | NaN | NaN | 2017-05-24 | Vivo, SOE | 3290.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | Unchecked | Unchecked | Unchecked | Unchecked | Unchecked | NaN | NaN | Unchecked | Unchecked | Unchecked | Unchecked | Unchecked | NaN | NaN | NaN | NaN | NaN | Unchecked | Unchecked | Unchecked | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 5 | 1705 | NaN | NaN | ENS. FUNDAMENTAL INCOMPLETO | 43.0 | Feminino | NaN | NaN | NaN | 2013-06-11 | Obito por câncer | 2224.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | Unchecked | Unchecked | Unchecked | Unchecked | Unchecked | NaN | NaN | Unchecked | Unchecked | Unchecked | Unchecked | Unchecked | NaN | NaN | NaN | NaN | NaN | Unchecked | Unchecked | Unchecked | Terapia Adjuvante | NaN | 2011-05-21 | NaN | Sim | 2011-09-08 | NaN | NaN | 2011-11-23 |
| 6 | 1843 | NaN | NaN | IGNORADA | 52.0 | Feminino | NaN | NaN | NaN | 2009-01-25 | Vivo, com câncer | 182.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | Unchecked | Unchecked | Unchecked | Unchecked | Unchecked | NaN | NaN | Unchecked | Unchecked | Unchecked | Unchecked | Unchecked | NaN | NaN | NaN | NaN | NaN | Unchecked | Unchecked | Unchecked | NaN | NaN | 2010-10-05 | Trastuzumabe | NaN | NaN | NaN | NaN | NaN |
| 7 | 1873 | NaN | NaN | IGNORADA | 40.0 | Feminino | NaN | NaN | NaN | 2017-07-08 | Vivo, SOE | 3234.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | Unchecked | Unchecked | Unchecked | Unchecked | Unchecked | NaN | NaN | Unchecked | Unchecked | Unchecked | Unchecked | Unchecked | NaN | NaN | NaN | NaN | NaN | Unchecked | Unchecked | Unchecked | NaN | NaN | NaN | Trastuzumabe | NaN | NaN | Switch: tamoxifeno seguido de IA | NaN | NaN |
| 8 | 1898 | NaN | NaN | ENS. FUNDAMENTAL COMPLETO | 60.0 | Feminino | NaN | NaN | NaN | 2009-08-22 | Obito por câncer | 428.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | Unchecked | Unchecked | Unchecked | Unchecked | Unchecked | NaN | NaN | Unchecked | Unchecked | Unchecked | Unchecked | Unchecked | NaN | NaN | NaN | NaN | NaN | Unchecked | Unchecked | Unchecked | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 9 | 1960 | NaN | NaN | ENS. FUNDAMENTAL INCOMPLETO | 29.0 | Feminino | NaN | NaN | NaN | 2010-06-27 | Vivo, SOE | 699.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | Unchecked | Unchecked | Unchecked | Unchecked | Unchecked | NaN | NaN | Unchecked | Unchecked | Unchecked | Unchecked | Unchecked | NaN | NaN | NaN | NaN | NaN | Unchecked | Unchecked | Unchecked | Terapia Neoadjuvante | NaN | NaN | Trastuzumabe | NaN | 2099-01-30 | NaN | NaN | NaN |
| record_id | repeat_instrument | repeat_instance | escolaridade | idade_do_paciente_ao_primeiro_diagnostico | sexo | raca_declarada_biobanco | uf_de_nascimento_do_paciente | uf_de_residencia_do_paciente | data_da_ultima_informacao_sobre_o_paciente | ultima_informacao_do_paciente | tempo_de_seguimento_em_dias_desde_o_ultimo_tumor_no_caso_de_tumores_multiplos_dt_pci | ja_ficou_gravida | quantas_vezes_ficou_gravida | numero_de_partos | idade_na_primeira_gestacao | abortou | amamentou_na_primeira_gestacao | por_quanto_tempo_amamentou | historia_familiar_de_cancer_relacionado_a_sindrome_de_cancer_de_mama_e_ovario_hereditaria_choice_nao | historia_familiar_de_cancer_relacionado_a_sindrome_de_cancer_de_mama_e_ovario_hereditaria_choice_sim_1o_grau_apenas_1_caso | historia_familiar_de_cancer_relacionado_a_sindrome_de_cancer_de_mama_e_ovario_hereditaria_choice_sim_1o_grau_mais_de_1_caso | historia_familiar_de_cancer_relacionado_a_sindrome_de_cancer_de_mama_e_ovario_hereditaria_choice_sim_2o_grau_apenas_1_caso | historia_familiar_de_cancer_relacionado_a_sindrome_de_cancer_de_mama_e_ovario_hereditaria_choice_sim_2o_grau_mais_de_1_caso | idade_da_primeira_mentruacao | faz_uso_de_metodos_contraceptivo | qual_metodo_choice_pilula_anticoncepcional | qual_metodo_choice_diu | qual_metodo_choice_camisinha | qual_metodo_choice_outros | qual_metodo_choice_nao_informou | ja_fez_uso_de_drogas | atividade_fisica | consumo_de_tabaco | consumo_de_alcool | possui_historico_familiar_de_cancer | grau_de_parentesco_de_familiar_com_cancer_choice_primeiro_pais_irmaos_filhos | grau_de_parentesco_de_familiar_com_cancer_choice_segundo_avos_tios_e_netos | grau_de_parentesco_de_familiar_com_cancer_choice_terceiro_bisavos_tio_avos_primos_sobrinhos | regime_de_tratamento | hormonioterapia | data_da_cirurgia | tipo_de_terapia_anti_her2_neoadjuvante | radioterapia | data_de_inicio_do_tratamento_quimioterapia | esquema_de_hormonioterapia | data_do_inicio_hormonioterapia_adjuvante | data_de_inicio_da_radioterapia | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 4262 | 82100 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 2021-07-25 | Vivo, SOE | 366.0 | NaN | NaN | NaN | NaN | NaN | NaN | 24.0 | Unchecked | Unchecked | Unchecked | Unchecked | Unchecked | 11.0 | NaN | Unchecked | Unchecked | Unchecked | Unchecked | Unchecked | NaN | NaN | NaN | NaN | NaN | Unchecked | Unchecked | Unchecked | Terapia Neoadjuvante | NaN | 2021-05-21 | Trastuzumabe | Sim | 2020-10-07 | NaN | NaN | 2021-07-16 |
| 4263 | 82111 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 2021-08-14 | Vivo, SOE | 413.0 | Sim | NaN | NaN | 23.0 | NaN | Sim | 12.0 | Unchecked | Unchecked | Unchecked | Unchecked | Unchecked | 13.0 | NaN | Unchecked | Unchecked | Unchecked | Unchecked | Unchecked | NaN | NaN | NaN | NaN | NaN | Unchecked | Unchecked | Unchecked | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 4264 | 82112 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 2021-10-04 | Vivo, SOE | 370.0 | Sim | NaN | NaN | 28.0 | NaN | Sim | 18.0 | Unchecked | Unchecked | Unchecked | Unchecked | Unchecked | 12.0 | NaN | Unchecked | Unchecked | Unchecked | Unchecked | Unchecked | NaN | NaN | NaN | NaN | NaN | Unchecked | Unchecked | Unchecked | Terapia Neoadjuvante | NaN | 2021-06-07 | Trastuzumabe | Sim | 2020-11-23 | NaN | NaN | 2021-07-25 |
| 4265 | 82118 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 2021-03-24 | Vivo, SOE | 391.0 | Sim | NaN | NaN | 21.0 | NaN | Sim | 18.0 | Unchecked | Unchecked | Unchecked | Unchecked | Unchecked | 15.0 | NaN | Unchecked | Unchecked | Unchecked | Unchecked | Unchecked | NaN | NaN | NaN | NaN | NaN | Unchecked | Unchecked | Unchecked | Terapia Adjuvante | NaN | 2020-05-26 | Trastuzumabe | NaN | 2020-09-30 | NaN | NaN | NaN |
| 4266 | 82122 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 2022-02-15 | Vivo, SOE | 589.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | Unchecked | Unchecked | Unchecked | Unchecked | Unchecked | NaN | NaN | Unchecked | Unchecked | Unchecked | Unchecked | Unchecked | NaN | NaN | NaN | NaN | NaN | Unchecked | Unchecked | Unchecked | Terapia Adjuvante | Adjuvante | NaN | Trastuzumabe | Sim | 2020-12-03 | Inibidor de aromatase isolado | 2021-06-24 | 2021-07-01 |
| 4267 | 82123 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 2021-10-25 | Vivo, SOE | 380.0 | NaN | NaN | NaN | NaN | NaN | Não | NaN | Unchecked | Unchecked | Unchecked | Unchecked | Unchecked | 12.0 | NaN | Unchecked | Unchecked | Unchecked | Unchecked | Unchecked | NaN | NaN | NaN | NaN | NaN | Unchecked | Unchecked | Unchecked | Terapia Neoadjuvante | NaN | 2021-07-04 | Trastuzumabe | Sim | 2020-12-14 | NaN | NaN | 2021-10-03 |
| 4268 | 82124 | NaN | NaN | NaN | 41.0 | NaN | NaN | NaN | NaN | 2021-01-21 | Obito por câncer | 138.0 | Sim | NaN | NaN | 27.0 | NaN | Sim | 24.0 | Unchecked | Unchecked | Unchecked | Unchecked | Unchecked | 13.0 | NaN | Unchecked | Unchecked | Unchecked | Unchecked | Unchecked | NaN | NaN | NaN | NaN | NaN | Unchecked | Unchecked | Unchecked | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 4269 | 82131 | NaN | NaN | ENSINO MÉDIO | 59.0 | Feminino | NaN | NaN | NaN | 2022-06-10 | Obito por câncer | 900.0 | Sim | NaN | NaN | 26.0 | NaN | Sim | 7.0 | Unchecked | Unchecked | Unchecked | Unchecked | Unchecked | 12.0 | NaN | Unchecked | Unchecked | Unchecked | Unchecked | Unchecked | NaN | NaN | NaN | NaN | NaN | Unchecked | Unchecked | Unchecked | NaN | NaN | 2020-12-23 | NaN | Sim | NaN | NaN | NaN | 2021-04-10 |
| 4270 | 82205 | NaN | NaN | NaN | 29.0 | NaN | NaN | NaN | NaN | 2022-04-29 | Obito por câncer | 538.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | Unchecked | Unchecked | Unchecked | Unchecked | Unchecked | NaN | NaN | Unchecked | Unchecked | Unchecked | Unchecked | Unchecked | NaN | NaN | NaN | NaN | NaN | Unchecked | Unchecked | Unchecked | NaN | NaN | NaN | NaN | Sim | NaN | NaN | NaN | 2022-02-22 |
| 4271 | 82240 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 2021-05-13 | Vivo, SOE | 425.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | Unchecked | Unchecked | Unchecked | Unchecked | Unchecked | NaN | NaN | Unchecked | Unchecked | Unchecked | Unchecked | Unchecked | NaN | NaN | NaN | NaN | NaN | Unchecked | Unchecked | Unchecked | NaN | NaN | NaN | NaN | Sim | NaN | NaN | NaN | 2021-01-12 |